Dataset statistics
| Number of variables | 42 |
|---|---|
| Number of observations | 199523 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 2766 |
| Duplicate rows (%) | 1.4% |
| Total size in memory | 63.9 MiB |
| Average record size in memory | 336.0 B |
Variable types
| Numeric | 10 |
|---|---|
| Categorical | 32 |
| Dataset has 2766 (1.4%) duplicate rows | Duplicates |
state_of_previous_residence has a high cardinality: 51 distinct values | High cardinality |
num_persons_worked_for_employer is highly correlated with weeks_worked_in_year | High correlation |
weeks_worked_in_year is highly correlated with num_persons_worked_for_employer | High correlation |
num_persons_worked_for_employer is highly correlated with weeks_worked_in_year | High correlation |
weeks_worked_in_year is highly correlated with num_persons_worked_for_employer | High correlation |
age is highly correlated with wage_per_hour and 4 other fields | High correlation |
wage_per_hour is highly correlated with age and 4 other fields | High correlation |
capital_gains is highly correlated with age and 4 other fields | High correlation |
capital_losses is highly correlated with age and 4 other fields | High correlation |
divdends_from_stocks is highly correlated with age and 4 other fields | High correlation |
instance_weight is highly correlated with label | High correlation |
num_persons_worked_for_employer is highly correlated with label | High correlation |
weeks_worked_in_year is highly correlated with label | High correlation |
label is highly correlated with age and 7 other fields | High correlation |
race is highly correlated with country_of_birth_mother and 2 other fields | High correlation |
tax_filer_status is highly correlated with weeks_worked_in_year and 12 other fields | High correlation |
migration_code_change_in_msa is highly correlated with migration_code_move_within_reg and 7 other fields | High correlation |
migration_code_move_within_reg is highly correlated with migration_code_change_in_msa and 7 other fields | High correlation |
country_of_birth_mother is highly correlated with race and 4 other fields | High correlation |
weeks_worked_in_year is highly correlated with tax_filer_status and 10 other fields | High correlation |
member_of_a_labor_union is highly correlated with class_of_worker and 1 other fields | High correlation |
citizenship is highly correlated with country_of_birth_mother and 3 other fields | High correlation |
hispanic_Origin is highly correlated with country_of_birth_mother and 3 other fields | High correlation |
full_or_part_time_employment_stat is highly correlated with migration_code_change_in_msa and 6 other fields | High correlation |
region_of_previous_residence is highly correlated with migration_code_change_in_msa and 5 other fields | High correlation |
migration_code_change_in_reg is highly correlated with migration_code_change_in_msa and 7 other fields | High correlation |
detailed_household_summary_in_household is highly correlated with tax_filer_status and 7 other fields | High correlation |
live_in_this_house_1_year_ago is highly correlated with migration_code_change_in_msa and 7 other fields | High correlation |
year is highly correlated with migration_code_change_in_msa and 5 other fields | High correlation |
veterans_benefits is highly correlated with tax_filer_status and 13 other fields | High correlation |
education is highly correlated with tax_filer_status and 12 other fields | High correlation |
major_occupation_code is highly correlated with tax_filer_status and 11 other fields | High correlation |
state_of_previous_residence is highly correlated with migration_code_change_in_msa and 5 other fields | High correlation |
class_of_worker is highly correlated with tax_filer_status and 12 other fields | High correlation |
sex is highly correlated with detailed_household_and_family_stat | High correlation |
wage_per_hour is highly correlated with member_of_a_labor_union | High correlation |
num_persons_worked_for_employer is highly correlated with tax_filer_status and 10 other fields | High correlation |
migration_prev_res_in_sunbelt is highly correlated with migration_code_change_in_msa and 7 other fields | High correlation |
age is highly correlated with tax_filer_status and 14 other fields | High correlation |
country_of_birth_self is highly correlated with race and 4 other fields | High correlation |
major_industry_code is highly correlated with tax_filer_status and 12 other fields | High correlation |
industry_code is highly correlated with tax_filer_status and 9 other fields | High correlation |
detailed_household_and_family_stat is highly correlated with tax_filer_status and 12 other fields | High correlation |
family_members_under_18 is highly correlated with tax_filer_status and 11 other fields | High correlation |
marital_status is highly correlated with tax_filer_status and 6 other fields | High correlation |
reason_for_unemployment is highly correlated with class_of_worker | High correlation |
country_of_birth_father is highly correlated with race and 4 other fields | High correlation |
fill_inc_questionnaire_for_veterans_admin is highly correlated with veterans_benefits | High correlation |
occupation_code is highly correlated with weeks_worked_in_year and 8 other fields | High correlation |
enrolled_in_edu_inst_last_wk is highly correlated with education and 2 other fields | High correlation |
tax_filer_status is highly correlated with veterans_benefits and 1 other fields | High correlation |
migration_code_change_in_msa is highly correlated with migration_code_move_within_reg and 5 other fields | High correlation |
migration_code_move_within_reg is highly correlated with migration_code_change_in_msa and 6 other fields | High correlation |
country_of_birth_mother is highly correlated with citizenship and 3 other fields | High correlation |
citizenship is highly correlated with country_of_birth_mother and 2 other fields | High correlation |
hispanic_Origin is highly correlated with country_of_birth_mother and 1 other fields | High correlation |
full_or_part_time_employment_stat is highly correlated with live_in_this_house_1_year_ago and 1 other fields | High correlation |
migration_code_change_in_reg is highly correlated with migration_code_change_in_msa and 5 other fields | High correlation |
region_of_previous_residence is highly correlated with migration_code_change_in_msa and 5 other fields | High correlation |
detailed_household_summary_in_household is highly correlated with veterans_benefits and 2 other fields | High correlation |
live_in_this_house_1_year_ago is highly correlated with migration_code_change_in_msa and 7 other fields | High correlation |
year is highly correlated with migration_code_change_in_msa and 5 other fields | High correlation |
veterans_benefits is highly correlated with tax_filer_status and 5 other fields | High correlation |
education is highly correlated with veterans_benefits | High correlation |
major_occupation_code is highly correlated with major_industry_code | High correlation |
state_of_previous_residence is highly correlated with migration_code_move_within_reg and 3 other fields | High correlation |
migration_prev_res_in_sunbelt is highly correlated with migration_code_change_in_msa and 6 other fields | High correlation |
country_of_birth_self is highly correlated with country_of_birth_mother and 2 other fields | High correlation |
major_industry_code is highly correlated with major_occupation_code | High correlation |
detailed_household_and_family_stat is highly correlated with tax_filer_status and 3 other fields | High correlation |
family_members_under_18 is highly correlated with detailed_household_summary_in_household and 2 other fields | High correlation |
country_of_birth_father is highly correlated with country_of_birth_mother and 3 other fields | High correlation |
fill_inc_questionnaire_for_veterans_admin is highly correlated with veterans_benefits | High correlation |
divdends_from_stocks is highly skewed (γ1 = 27.78650179) | Skewed |
age has 2839 (1.4%) zeros | Zeros |
industry_code has 100684 (50.5%) zeros | Zeros |
occupation_code has 100684 (50.5%) zeros | Zeros |
wage_per_hour has 188219 (94.3%) zeros | Zeros |
capital_gains has 192144 (96.3%) zeros | Zeros |
capital_losses has 195617 (98.0%) zeros | Zeros |
divdends_from_stocks has 178382 (89.4%) zeros | Zeros |
num_persons_worked_for_employer has 95983 (48.1%) zeros | Zeros |
weeks_worked_in_year has 95983 (48.1%) zeros | Zeros |
Reproduction
| Analysis started | 2021-09-08 17:05:06.466431 |
|---|---|
| Analysis finished | 2021-09-08 17:23:18.238996 |
| Duration | 18 minutes and 11.77 seconds |
| Software version | pandas-profiling v3.0.0 |
| Download configuration | config.json |
| Distinct | 91 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 34.49419866 |
| Minimum | 0 |
|---|---|
| Maximum | 90 |
| Zeros | 2839 |
| Zeros (%) | 1.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 15 |
| median | 33 |
| Q3 | 50 |
| 95-th percentile | 75 |
| Maximum | 90 |
| Range | 90 |
| Interquartile range (IQR) | 35 |
Descriptive statistics
| Standard deviation | 22.31089521 |
|---|---|
| Coefficient of variation (CV) | 0.6468013774 |
| Kurtosis | -0.7328243009 |
| Mean | 34.49419866 |
| Median Absolute Deviation (MAD) | 17 |
| Skewness | 0.3732904573 |
| Sum | 6882386 |
| Variance | 497.7760449 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 34 | 3489 | 1.7% |
| 35 | 3450 | 1.7% |
| 36 | 3353 | 1.7% |
| 31 | 3351 | 1.7% |
| 33 | 3340 | 1.7% |
| 5 | 3332 | 1.7% |
| 4 | 3318 | 1.7% |
| 3 | 3279 | 1.6% |
| 37 | 3278 | 1.6% |
| 38 | 3277 | 1.6% |
| Other values (81) | 166056 |
| Value | Count | Frequency (%) |
| 0 | 2839 | |
| 1 | 3138 | |
| 2 | 3236 | |
| 3 | 3279 | |
| 4 | 3318 | |
| 5 | 3332 | |
| 6 | 3171 | |
| 7 | 3218 | |
| 8 | 3187 | |
| 9 | 3162 |
| Value | Count | Frequency (%) |
| 90 | 725 | |
| 89 | 195 | 0.1% |
| 88 | 241 | 0.1% |
| 87 | 301 | |
| 86 | 348 | |
| 85 | 423 | |
| 84 | 519 | |
| 83 | 561 | |
| 82 | 615 | |
| 81 | 720 |
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Not in universe | |
|---|---|
| Private | |
| Self-employed-not incorporated | 8445 |
| Local government | 7784 |
| State government | 4227 |
| Other values (4) | 6794 |
Length
| Max length | 31 |
|---|---|
| Median length | 16 |
| Mean length | 14.02115546 |
| Min length | 8 |
Characters and Unicode
| Total characters | 2797543 |
|---|---|
| Distinct characters | 29 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not in universe |
|---|---|
| 2nd row | Self-employed-not incorporated |
| 3rd row | Not in universe |
| 4th row | Not in universe |
| 5th row | Not in universe |
Common Values
| Value | Count | Frequency (%) |
| Not in universe | 100245 | |
| Private | 72028 | |
| Self-employed-not incorporated | 8445 | 4.2% |
| Local government | 7784 | 3.9% |
| State government | 4227 | 2.1% |
| Self-employed-incorporated | 3265 | 1.6% |
| Federal government | 2925 | 1.5% |
| Never worked | 439 | 0.2% |
| Without pay | 165 | 0.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| not | 100245 | |
| in | 100245 | |
| universe | 100245 | |
| private | 72028 | |
| government | 14936 | 3.5% |
| self-employed-not | 8445 | 2.0% |
| incorporated | 8445 | 2.0% |
| local | 7784 | 1.8% |
| state | 4227 | 1.0% |
| self-employed-incorporated | 3265 | 0.8% |
| Other values (5) | 4133 | 1.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 423998 | ||
| e | 360624 | |
| i | 284393 | |
| n | 250517 | |
| t | 216148 | |
| r | 214432 | |
| v | 187648 | 6.7% |
| o | 167144 | 6.0% |
| N | 100684 | 3.6% |
| u | 100410 | 3.6% |
| Other values (19) | 491545 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2150602 | |
| Space Separator | 423998 | 15.2% |
| Uppercase Letter | 199523 | 7.1% |
| Dash Punctuation | 23420 | 0.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 360624 | |
| i | 284393 | |
| n | 250517 | |
| t | 216148 | |
| r | 214432 | |
| v | 187648 | |
| o | 167144 | |
| u | 100410 | 4.7% |
| s | 100245 | 4.7% |
| a | 98839 | 4.6% |
| Other values (11) | 170202 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 100684 | |
| P | 72028 | |
| S | 15937 | 8.0% |
| L | 7784 | 3.9% |
| F | 2925 | 1.5% |
| W | 165 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 423998 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 23420 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2350125 | |
| Common | 447418 | 16.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 360624 | |
| i | 284393 | |
| n | 250517 | |
| t | 216148 | |
| r | 214432 | |
| v | 187648 | |
| o | 167144 | |
| N | 100684 | 4.3% |
| u | 100410 | 4.3% |
| s | 100245 | 4.3% |
| Other values (17) | 367880 |
Common
| Value | Count | Frequency (%) |
| 423998 | ||
| - | 23420 | 5.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2797543 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 423998 | ||
| e | 360624 | |
| i | 284393 | |
| n | 250517 | |
| t | 216148 | |
| r | 214432 | |
| v | 187648 | 6.7% |
| o | 167144 | 6.0% |
| N | 100684 | 3.6% |
| u | 100410 | 3.6% |
| Other values (19) | 491545 |
| Distinct | 52 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.35232028 |
| Minimum | 0 |
|---|---|
| Maximum | 51 |
| Zeros | 100684 |
| Zeros (%) | 50.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 33 |
| 95-th percentile | 44 |
| Maximum | 51 |
| Range | 51 |
| Interquartile range (IQR) | 33 |
Descriptive statistics
| Standard deviation | 18.0671288 |
|---|---|
| Coefficient of variation (CV) | 1.17683376 |
| Kurtosis | -1.501107921 |
| Mean | 15.35232028 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.5166876791 |
| Sum | 3063141 |
| Variance | 326.421143 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 100684 | |
| 33 | 17070 | 8.6% |
| 43 | 8283 | 4.2% |
| 4 | 5984 | 3.0% |
| 42 | 4683 | 2.3% |
| 45 | 4482 | 2.2% |
| 29 | 4209 | 2.1% |
| 37 | 4022 | 2.0% |
| 41 | 3964 | 2.0% |
| 32 | 3596 | 1.8% |
| Other values (42) | 42546 |
| Value | Count | Frequency (%) |
| 0 | 100684 | |
| 1 | 827 | 0.4% |
| 2 | 2196 | 1.1% |
| 3 | 563 | 0.3% |
| 4 | 5984 | 3.0% |
| 5 | 553 | 0.3% |
| 6 | 554 | 0.3% |
| 7 | 422 | 0.2% |
| 8 | 550 | 0.3% |
| 9 | 993 | 0.5% |
| Value | Count | Frequency (%) |
| 51 | 36 | < 0.1% |
| 50 | 1704 | 0.9% |
| 49 | 610 | 0.3% |
| 48 | 652 | 0.3% |
| 47 | 1644 | 0.8% |
| 46 | 187 | 0.1% |
| 45 | 4482 | |
| 44 | 2549 | 1.3% |
| 43 | 8283 | |
| 42 | 4683 |
| Distinct | 47 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.30655614 |
| Minimum | 0 |
|---|---|
| Maximum | 46 |
| Zeros | 100684 |
| Zeros (%) | 50.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 26 |
| 95-th percentile | 38 |
| Maximum | 46 |
| Range | 46 |
| Interquartile range (IQR) | 26 |
Descriptive statistics
| Standard deviation | 14.45420392 |
|---|---|
| Coefficient of variation (CV) | 1.278391381 |
| Kurtosis | -0.8965333655 |
| Mean | 11.30655614 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.829238138 |
| Sum | 2255918 |
| Variance | 208.9240109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 100684 | |
| 2 | 8756 | 4.4% |
| 26 | 7887 | 4.0% |
| 19 | 5413 | 2.7% |
| 29 | 5105 | 2.6% |
| 36 | 4145 | 2.1% |
| 34 | 4025 | 2.0% |
| 10 | 3683 | 1.8% |
| 16 | 3445 | 1.7% |
| 23 | 3392 | 1.7% |
| Other values (37) | 52988 |
| Value | Count | Frequency (%) |
| 0 | 100684 | |
| 1 | 544 | 0.3% |
| 2 | 8756 | 4.4% |
| 3 | 3195 | 1.6% |
| 4 | 1364 | 0.7% |
| 5 | 855 | 0.4% |
| 6 | 441 | 0.2% |
| 7 | 731 | 0.4% |
| 8 | 2151 | 1.1% |
| 9 | 738 | 0.4% |
| Value | Count | Frequency (%) |
| 46 | 36 | < 0.1% |
| 45 | 172 | 0.1% |
| 44 | 1592 | |
| 43 | 1382 | |
| 42 | 1918 | |
| 41 | 1592 | |
| 40 | 617 | 0.3% |
| 39 | 1017 | 0.5% |
| 38 | 3003 | |
| 37 | 2234 |
| Distinct | 17 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| High school graduate | |
|---|---|
| Children | |
| Some college but no degree | |
| Bachelors degree(BA AB BS) | |
| 7th and 8th grade | |
| Other values (12) |
Length
| Max length | 39 |
|---|---|
| Median length | 21 |
| Mean length | 19.86398561 |
| Min length | 9 |
Characters and Unicode
| Total characters | 3963322 |
|---|---|
| Distinct characters | 47 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | High school graduate |
|---|---|
| 2nd row | Some college but no degree |
| 3rd row | 10th grade |
| 4th row | Children |
| 5th row | Children |
Common Values
| Value | Count | Frequency (%) |
| High school graduate | 48407 | |
| Children | 47422 | |
| Some college but no degree | 27820 | |
| Bachelors degree(BA AB BS) | 19865 | |
| 7th and 8th grade | 8007 | 4.0% |
| 10th grade | 7557 | 3.8% |
| 11th grade | 6876 | 3.4% |
| Masters degree(MA MS MEng MEd MSW MBA) | 6541 | 3.3% |
| 9th grade | 6230 | 3.1% |
| Associates degree-occup /vocational | 5358 | 2.7% |
| Other values (7) | 15440 | 7.7% |
Length
| Value | Count | Frequency (%) |
| school | 50200 | 8.2% |
| graduate | 48407 | 7.9% |
| high | 48407 | 7.9% |
| children | 47422 | 7.7% |
| grade | 36691 | 6.0% |
| no | 29946 | 4.9% |
| degree | 29613 | 4.8% |
| some | 27820 | 4.5% |
| college | 27820 | 4.5% |
| but | 27820 | 4.5% |
| Other values (42) | 239176 |
Most occurring characters
| Value | Count | Frequency (%) |
| 613322 | ||
| e | 459561 | 11.6% |
| o | 247530 | 6.2% |
| r | 244586 | 6.2% |
| g | 239232 | 6.0% |
| d | 225421 | 5.7% |
| h | 215132 | 5.4% |
| a | 205652 | 5.2% |
| l | 180611 | 4.6% |
| t | 150966 | 3.8% |
| Other values (37) | 1181309 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2803290 | |
| Space Separator | 613322 | 15.5% |
| Uppercase Letter | 402776 | 10.2% |
| Decimal Number | 69931 | 1.8% |
| Open Punctuation | 29462 | 0.7% |
| Close Punctuation | 29462 | 0.7% |
| Dash Punctuation | 9721 | 0.2% |
| Other Punctuation | 5358 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 459561 | |
| o | 247530 | |
| r | 244586 | |
| g | 239232 | |
| d | 225421 | |
| h | 215132 | |
| a | 205652 | |
| l | 180611 | 6.4% |
| t | 150966 | 5.4% |
| c | 133669 | 4.8% |
| Other values (9) | 500930 |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 87794 | |
| S | 62560 | |
| A | 62533 | |
| M | 49373 | |
| H | 48407 | |
| C | 47422 | |
| E | 14345 | 3.6% |
| D | 12754 | 3.2% |
| W | 6541 | 1.6% |
| L | 4405 | 1.1% |
| Other values (3) | 6642 | 1.6% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 26053 | |
| 7 | 8007 | 11.4% |
| 8 | 8007 | 11.4% |
| 0 | 7557 | 10.8% |
| 9 | 6230 | 8.9% |
| 2 | 3925 | 5.6% |
| 5 | 3277 | 4.7% |
| 6 | 3277 | 4.7% |
| 3 | 1799 | 2.6% |
| 4 | 1799 | 2.6% |
Space Separator
| Value | Count | Frequency (%) |
| 613322 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 29462 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 29462 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 9721 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 5358 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3206066 | |
| Common | 757256 | 19.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 459561 | |
| o | 247530 | 7.7% |
| r | 244586 | 7.6% |
| g | 239232 | 7.5% |
| d | 225421 | 7.0% |
| h | 215132 | 6.7% |
| a | 205652 | 6.4% |
| l | 180611 | 5.6% |
| t | 150966 | 4.7% |
| c | 133669 | 4.2% |
| Other values (22) | 903706 |
Common
| Value | Count | Frequency (%) |
| 613322 | ||
| ( | 29462 | 3.9% |
| ) | 29462 | 3.9% |
| 1 | 26053 | 3.4% |
| - | 9721 | 1.3% |
| 7 | 8007 | 1.1% |
| 8 | 8007 | 1.1% |
| 0 | 7557 | 1.0% |
| 9 | 6230 | 0.8% |
| / | 5358 | 0.7% |
| Other values (5) | 14077 | 1.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3963322 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 613322 | ||
| e | 459561 | 11.6% |
| o | 247530 | 6.2% |
| r | 244586 | 6.2% |
| g | 239232 | 6.0% |
| d | 225421 | 5.7% |
| h | 215132 | 5.4% |
| a | 205652 | 5.2% |
| l | 180611 | 4.6% |
| t | 150966 | 3.8% |
| Other values (37) | 1181309 |
| Distinct | 1240 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 55.42690818 |
| Minimum | 0 |
|---|---|
| Maximum | 9999 |
| Zeros | 188219 |
| Zeros (%) | 94.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 495 |
| Maximum | 9999 |
| Range | 9999 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 274.8964539 |
|---|---|
| Coefficient of variation (CV) | 4.959620931 |
| Kurtosis | 155.2188969 |
| Mean | 55.42690818 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 8.935096531 |
| Sum | 11058943 |
| Variance | 75568.06037 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 188219 | |
| 500 | 734 | 0.4% |
| 600 | 546 | 0.3% |
| 700 | 534 | 0.3% |
| 800 | 507 | 0.3% |
| 1000 | 386 | 0.2% |
| 425 | 376 | 0.2% |
| 900 | 336 | 0.2% |
| 550 | 280 | 0.1% |
| 1200 | 256 | 0.1% |
| Other values (1230) | 7349 | 3.7% |
| Value | Count | Frequency (%) |
| 0 | 188219 | |
| 20 | 1 | < 0.1% |
| 70 | 1 | < 0.1% |
| 75 | 2 | < 0.1% |
| 100 | 11 | < 0.1% |
| 110 | 1 | < 0.1% |
| 125 | 1 | < 0.1% |
| 135 | 1 | < 0.1% |
| 143 | 1 | < 0.1% |
| 150 | 6 | < 0.1% |
| Value | Count | Frequency (%) |
| 9999 | 1 | < 0.1% |
| 9916 | 1 | < 0.1% |
| 9800 | 2 | |
| 9400 | 2 | |
| 9000 | 1 | < 0.1% |
| 8800 | 1 | < 0.1% |
| 8600 | 1 | < 0.1% |
| 8500 | 1 | < 0.1% |
| 8300 | 1 | < 0.1% |
| 8000 | 4 |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Not in universe | |
|---|---|
| High school | 6892 |
| College or university | 5688 |
Length
| Max length | 22 |
|---|---|
| Median length | 16 |
| Mean length | 16.03287842 |
| Min length | 12 |
Characters and Unicode
| Total characters | 3198928 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not in universe |
|---|---|
| 2nd row | Not in universe |
| 3rd row | High school |
| 4th row | Not in universe |
| 5th row | Not in universe |
Common Values
| Value | Count | Frequency (%) |
| Not in universe | 186943 | |
| High school | 6892 | 3.5% |
| College or university | 5688 | 2.9% |
Length
Pie chart
| Value | Count | Frequency (%) |
| not | 186943 | |
| in | 186943 | |
| universe | 186943 | |
| high | 6892 | 1.2% |
| school | 6892 | 1.2% |
| college | 5688 | 1.0% |
| or | 5688 | 1.0% |
| university | 5688 | 1.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 591677 | ||
| i | 392154 | |
| e | 390950 | |
| n | 379574 | |
| o | 212103 | 6.6% |
| s | 199523 | 6.2% |
| r | 198319 | 6.2% |
| t | 192631 | 6.0% |
| u | 192631 | 6.0% |
| v | 192631 | 6.0% |
| Other values (8) | 256735 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2407728 | |
| Space Separator | 591677 | 18.5% |
| Uppercase Letter | 199523 | 6.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 392154 | |
| e | 390950 | |
| n | 379574 | |
| o | 212103 | |
| s | 199523 | |
| r | 198319 | |
| t | 192631 | |
| u | 192631 | |
| v | 192631 | |
| l | 18268 | 0.8% |
| Other values (4) | 38944 | 1.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 186943 | |
| H | 6892 | 3.5% |
| C | 5688 | 2.9% |
Space Separator
| Value | Count | Frequency (%) |
| 591677 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2607251 | |
| Common | 591677 | 18.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 392154 | |
| e | 390950 | |
| n | 379574 | |
| o | 212103 | |
| s | 199523 | |
| r | 198319 | |
| t | 192631 | |
| u | 192631 | |
| v | 192631 | |
| N | 186943 | |
| Other values (7) | 69792 | 2.7% |
Common
| Value | Count | Frequency (%) |
| 591677 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3198928 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 591677 | ||
| i | 392154 | |
| e | 390950 | |
| n | 379574 | |
| o | 212103 | 6.6% |
| s | 199523 | 6.2% |
| r | 198319 | 6.2% |
| t | 192631 | 6.0% |
| u | 192631 | 6.0% |
| v | 192631 | 6.0% |
| Other values (8) | 256735 |
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Never married | |
|---|---|
| Married-civilian spouse present | |
| Divorced | |
| Widowed | |
| Separated | 3460 |
| Other values (2) | 2183 |
Length
| Max length | 32 |
|---|---|
| Median length | 14 |
| Mean length | 20.99977947 |
| Min length | 8 |
Characters and Unicode
| Total characters | 4189939 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Widowed |
|---|---|
| 2nd row | Divorced |
| 3rd row | Never married |
| 4th row | Never married |
| 5th row | Never married |
Common Values
| Value | Count | Frequency (%) |
| Never married | 86485 | |
| Married-civilian spouse present | 84222 | |
| Divorced | 12710 | 6.4% |
| Widowed | 10463 | 5.2% |
| Separated | 3460 | 1.7% |
| Married-spouse absent | 1518 | 0.8% |
| Married-A F spouse present | 665 | 0.3% |
Length
Pie chart
| Value | Count | Frequency (%) |
| never | 86485 | |
| married | 86485 | |
| spouse | 84887 | |
| present | 84887 | |
| married-civilian | 84222 | |
| divorced | 12710 | 2.8% |
| widowed | 10463 | 2.3% |
| separated | 3460 | 0.8% |
| married-spouse | 1518 | 0.3% |
| absent | 1518 | 0.3% |
| Other values (2) | 1330 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 633650 | |
| r | 533322 | |
| 457965 | ||
| i | 448729 | |
| a | 265550 | 6.3% |
| s | 259215 | 6.2% |
| d | 209986 | 5.0% |
| v | 183417 | 4.4% |
| p | 174752 | 4.2% |
| n | 170627 | 4.1% |
| Other values (16) | 852726 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3444716 | |
| Space Separator | 457965 | 10.9% |
| Uppercase Letter | 200853 | 4.8% |
| Dash Punctuation | 86405 | 2.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 633650 | |
| r | 533322 | |
| i | 448729 | |
| a | 265550 | |
| s | 259215 | |
| d | 209986 | 6.1% |
| v | 183417 | 5.3% |
| p | 174752 | 5.1% |
| n | 170627 | 5.0% |
| o | 109578 | 3.2% |
| Other values (7) | 455890 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 86485 | |
| M | 86405 | |
| D | 12710 | 6.3% |
| W | 10463 | 5.2% |
| S | 3460 | 1.7% |
| A | 665 | 0.3% |
| F | 665 | 0.3% |
Space Separator
| Value | Count | Frequency (%) |
| 457965 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 86405 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3645569 | |
| Common | 544370 | 13.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 633650 | |
| r | 533322 | |
| i | 448729 | |
| a | 265550 | |
| s | 259215 | 7.1% |
| d | 209986 | 5.8% |
| v | 183417 | 5.0% |
| p | 174752 | 4.8% |
| n | 170627 | 4.7% |
| o | 109578 | 3.0% |
| Other values (14) | 656743 |
Common
| Value | Count | Frequency (%) |
| 457965 | ||
| - | 86405 | 15.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4189939 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 633650 | |
| r | 533322 | |
| 457965 | ||
| i | 448729 | |
| a | 265550 | 6.3% |
| s | 259215 | 6.2% |
| d | 209986 | 5.0% |
| v | 183417 | 4.4% |
| p | 174752 | 4.2% |
| n | 170627 | 4.1% |
| Other values (16) | 852726 |
| Distinct | 24 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Not in universe or children | |
|---|---|
| Retail trade | |
| Manufacturing-durable goods | 9015 |
| Education | 8283 |
| Manufacturing-nondurable goods | 6897 |
| Other values (19) |
Length
| Max length | 36 |
|---|---|
| Median length | 28 |
| Mean length | 24.39614982 |
| Min length | 7 |
Characters and Unicode
| Total characters | 4867593 |
|---|---|
| Distinct characters | 38 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not in universe or children |
|---|---|
| 2nd row | Construction |
| 3rd row | Not in universe or children |
| 4th row | Not in universe or children |
| 5th row | Not in universe or children |
Common Values
| Value | Count | Frequency (%) |
| Not in universe or children | 100684 | |
| Retail trade | 17070 | 8.6% |
| Manufacturing-durable goods | 9015 | 4.5% |
| Education | 8283 | 4.2% |
| Manufacturing-nondurable goods | 6897 | 3.5% |
| Finance insurance and real estate | 6145 | 3.1% |
| Construction | 5984 | 3.0% |
| Business and repair services | 5651 | 2.8% |
| Medical except hospital | 4683 | 2.3% |
| Public administration | 4610 | 2.3% |
| Other values (14) | 30501 | 15.3% |
Length
| Value | Count | Frequency (%) |
| not | 100684 | |
| universe | 100684 | |
| or | 100684 | |
| children | 100684 | |
| in | 100684 | |
| services | 21706 | 3.0% |
| trade | 20666 | 2.8% |
| retail | 17070 | 2.3% |
| goods | 15912 | 2.2% |
| and | 13161 | 1.8% |
| Other values (34) | 135470 |
Most occurring characters
| Value | Count | Frequency (%) |
| 727405 | ||
| e | 493118 | |
| i | 454739 | 9.3% |
| n | 445989 | 9.2% |
| r | 444143 | 9.1% |
| o | 304536 | 6.3% |
| t | 242020 | 5.0% |
| s | 233277 | 4.8% |
| a | 190749 | 3.9% |
| c | 188561 | 3.9% |
| Other values (28) | 1143056 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3918843 | |
| Space Separator | 727405 | 14.9% |
| Uppercase Letter | 205433 | 4.2% |
| Dash Punctuation | 15912 | 0.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 493118 | |
| i | 454739 | |
| n | 445989 | |
| r | 444143 | |
| o | 304536 | |
| t | 242020 | 6.2% |
| s | 233277 | 6.0% |
| a | 190749 | 4.9% |
| c | 188561 | 4.8% |
| u | 187265 | 4.8% |
| Other values (11) | 734446 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 100684 | |
| M | 21158 | 10.3% |
| R | 17070 | 8.3% |
| E | 9934 | 4.8% |
| H | 9838 | 4.8% |
| P | 8492 | 4.1% |
| C | 7165 | 3.5% |
| F | 6368 | 3.1% |
| B | 5651 | 2.8% |
| O | 4482 | 2.2% |
| Other values (5) | 14591 | 7.1% |
Space Separator
| Value | Count | Frequency (%) |
| 727405 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 15912 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4124276 | |
| Common | 743317 | 15.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 493118 | |
| i | 454739 | |
| n | 445989 | |
| r | 444143 | |
| o | 304536 | 7.4% |
| t | 242020 | 5.9% |
| s | 233277 | 5.7% |
| a | 190749 | 4.6% |
| c | 188561 | 4.6% |
| u | 187265 | 4.5% |
| Other values (26) | 939879 |
Common
| Value | Count | Frequency (%) |
| 727405 | ||
| - | 15912 | 2.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4867593 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 727405 | ||
| e | 493118 | |
| i | 454739 | 9.3% |
| n | 445989 | 9.2% |
| r | 444143 | 9.1% |
| o | 304536 | 6.3% |
| t | 242020 | 5.0% |
| s | 233277 | 4.8% |
| a | 190749 | 3.9% |
| c | 188561 | 3.9% |
| Other values (28) | 1143056 |
| Distinct | 15 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Not in universe | |
|---|---|
| Adm support including clerical | |
| Professional specialty | |
| Executive admin and managerial | |
| Other service | |
| Other values (10) |
Length
| Max length | 38 |
|---|---|
| Median length | 16 |
| Mean length | 20.76417756 |
| Min length | 6 |
Characters and Unicode
| Total characters | 4142931 |
|---|---|
| Distinct characters | 34 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not in universe |
|---|---|
| 2nd row | Precision production craft & repair |
| 3rd row | Not in universe |
| 4th row | Not in universe |
| 5th row | Not in universe |
Common Values
| Value | Count | Frequency (%) |
| Not in universe | 100684 | |
| Adm support including clerical | 14837 | 7.4% |
| Professional specialty | 13940 | 7.0% |
| Executive admin and managerial | 12495 | 6.3% |
| Other service | 12099 | 6.1% |
| Sales | 11783 | 5.9% |
| Precision production craft & repair | 10518 | 5.3% |
| Machine operators assmblrs & inspctrs | 6379 | 3.2% |
| Handlers equip cleaners etc | 4127 | 2.1% |
| Transportation and material moving | 4020 | 2.0% |
| Other values (5) | 8641 | 4.3% |
Length
| Value | Count | Frequency (%) |
| not | 100684 | |
| universe | 100684 | |
| in | 100684 | |
| and | 22679 | 3.6% |
| support | 17855 | 2.9% |
| 16897 | 2.7% | |
| including | 14837 | 2.4% |
| adm | 14837 | 2.4% |
| clerical | 14837 | 2.4% |
| specialty | 13940 | 2.2% |
| Other values (33) | 204770 |
Most occurring characters
| Value | Count | Frequency (%) |
| 626831 | ||
| i | 414716 | |
| e | 410135 | |
| n | 359087 | 8.7% |
| r | 299839 | 7.2% |
| s | 260315 | 6.3% |
| t | 217320 | 5.2% |
| o | 209194 | 5.0% |
| a | 201628 | 4.9% |
| u | 161296 | 3.9% |
| Other values (24) | 982570 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3299644 | |
| Space Separator | 626831 | 15.1% |
| Uppercase Letter | 199559 | 4.8% |
| Other Punctuation | 16897 | 0.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 414716 | |
| e | 410135 | |
| n | 359087 | |
| r | 299839 | |
| s | 260315 | |
| t | 217320 | 6.6% |
| o | 209194 | 6.3% |
| a | 201628 | 6.1% |
| u | 161296 | 4.9% |
| c | 145785 | 4.4% |
| Other values (12) | 620329 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 100684 | |
| P | 26899 | 13.5% |
| A | 14873 | 7.5% |
| E | 12495 | 6.3% |
| O | 12099 | 6.1% |
| S | 11783 | 5.9% |
| T | 7038 | 3.5% |
| M | 6379 | 3.2% |
| H | 4127 | 2.1% |
| F | 3182 | 1.6% |
Space Separator
| Value | Count | Frequency (%) |
| 626831 |
Other Punctuation
| Value | Count | Frequency (%) |
| & | 16897 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3499203 | |
| Common | 643728 | 15.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 414716 | |
| e | 410135 | |
| n | 359087 | |
| r | 299839 | 8.6% |
| s | 260315 | 7.4% |
| t | 217320 | 6.2% |
| o | 209194 | 6.0% |
| a | 201628 | 5.8% |
| u | 161296 | 4.6% |
| c | 145785 | 4.2% |
| Other values (22) | 819888 |
Common
| Value | Count | Frequency (%) |
| 626831 | ||
| & | 16897 | 2.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4142931 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 626831 | ||
| i | 414716 | |
| e | 410135 | |
| n | 359087 | 8.7% |
| r | 299839 | 7.2% |
| s | 260315 | 6.3% |
| t | 217320 | 5.2% |
| o | 209194 | 5.0% |
| a | 201628 | 4.9% |
| u | 161296 | 3.9% |
| Other values (24) | 982570 |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| White | |
|---|---|
| Black | |
| Asian or Pacific Islander | 5835 |
| Other | 3657 |
| Amer Indian Aleut or Eskimo | 2251 |
Length
| Max length | 28 |
|---|---|
| Median length | 6 |
| Mean length | 6.833096936 |
| Min length | 6 |
Characters and Unicode
| Total characters | 1363360 |
|---|---|
| Distinct characters | 24 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | White |
|---|---|
| 2nd row | White |
| 3rd row | Asian or Pacific Islander |
| 4th row | White |
| 5th row | White |
Common Values
| Value | Count | Frequency (%) |
| White | 167365 | |
| Black | 20415 | 10.2% |
| Asian or Pacific Islander | 5835 | 2.9% |
| Other | 3657 | 1.8% |
| Amer Indian Aleut or Eskimo | 2251 | 1.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| white | 167365 | |
| black | 20415 | 9.0% |
| or | 8086 | 3.6% |
| asian | 5835 | 2.6% |
| pacific | 5835 | 2.6% |
| islander | 5835 | 2.6% |
| other | 3657 | 1.6% |
| amer | 2251 | 1.0% |
| indian | 2251 | 1.0% |
| aleut | 2251 | 1.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 226032 | ||
| i | 189372 | |
| e | 181359 | |
| t | 173273 | |
| h | 171022 | |
| W | 167365 | |
| a | 40171 | 2.9% |
| c | 32085 | 2.4% |
| l | 28501 | 2.1% |
| k | 22666 | 1.7% |
| Other values (14) | 131514 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 919382 | |
| Space Separator | 226032 | 16.6% |
| Uppercase Letter | 217946 | 16.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 189372 | |
| e | 181359 | |
| t | 173273 | |
| h | 171022 | |
| a | 40171 | 4.4% |
| c | 32085 | 3.5% |
| l | 28501 | 3.1% |
| k | 22666 | 2.5% |
| r | 19829 | 2.2% |
| n | 16172 | 1.8% |
| Other values (6) | 44932 | 4.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| W | 167365 | |
| B | 20415 | 9.4% |
| A | 10337 | 4.7% |
| I | 8086 | 3.7% |
| P | 5835 | 2.7% |
| O | 3657 | 1.7% |
| E | 2251 | 1.0% |
Space Separator
| Value | Count | Frequency (%) |
| 226032 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1137328 | |
| Common | 226032 | 16.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 189372 | |
| e | 181359 | |
| t | 173273 | |
| h | 171022 | |
| W | 167365 | |
| a | 40171 | 3.5% |
| c | 32085 | 2.8% |
| l | 28501 | 2.5% |
| k | 22666 | 2.0% |
| B | 20415 | 1.8% |
| Other values (13) | 111099 |
Common
| Value | Count | Frequency (%) |
| 226032 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1363360 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 226032 | ||
| i | 189372 | |
| e | 181359 | |
| t | 173273 | |
| h | 171022 | |
| W | 167365 | |
| a | 40171 | 2.9% |
| c | 32085 | 2.4% |
| l | 28501 | 2.1% |
| k | 22666 | 1.7% |
| Other values (14) | 131514 |
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| All other | |
|---|---|
| Mexican-American | 8079 |
| Mexican (Mexicano) | 7234 |
| Central or South American | 3895 |
| Puerto Rican | 3313 |
| Other values (5) | 5095 |
Length
| Max length | 26 |
|---|---|
| Median length | 10 |
| Mean length | 10.9685099 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2188470 |
|---|---|
| Distinct characters | 31 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | All other |
|---|---|
| 2nd row | All other |
| 3rd row | All other |
| 4th row | All other |
| 5th row | All other |
Common Values
| Value | Count | Frequency (%) |
| All other | 171907 | |
| Mexican-American | 8079 | 4.0% |
| Mexican (Mexicano) | 7234 | 3.6% |
| Central or South American | 3895 | 2.0% |
| Puerto Rican | 3313 | 1.7% |
| Other Spanish | 2485 | 1.2% |
| Cuban | 1126 | 0.6% |
| NA | 874 | 0.4% |
| Do not know | 306 | 0.2% |
| Chicano | 304 | 0.2% |
Length
Pie chart
| Value | Count | Frequency (%) |
| other | 174392 | |
| all | 171907 | |
| mexican-american | 8079 | 2.0% |
| mexicano | 7234 | 1.8% |
| mexican | 7234 | 1.8% |
| central | 3895 | 1.0% |
| or | 3895 | 1.0% |
| south | 3895 | 1.0% |
| american | 3895 | 1.0% |
| puerto | 3313 | 0.8% |
| Other values (8) | 9020 | 2.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 396759 | ||
| l | 347709 | |
| e | 216121 | |
| r | 197469 | |
| o | 191466 | |
| t | 185801 | |
| A | 184755 | |
| h | 181076 | |
| n | 46256 | 2.1% |
| a | 45644 | 2.1% |
| Other values (21) | 195414 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1539866 | |
| Space Separator | 396759 | 18.1% |
| Uppercase Letter | 229298 | 10.5% |
| Dash Punctuation | 8079 | 0.4% |
| Open Punctuation | 7234 | 0.3% |
| Close Punctuation | 7234 | 0.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 347709 | |
| e | 216121 | |
| r | 197469 | |
| o | 191466 | |
| t | 185801 | |
| h | 181076 | |
| n | 46256 | 3.0% |
| a | 45644 | 3.0% |
| i | 40623 | 2.6% |
| c | 38138 | 2.5% |
| Other values (8) | 49563 | 3.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 184755 | |
| M | 22547 | 9.8% |
| S | 6380 | 2.8% |
| C | 5325 | 2.3% |
| P | 3313 | 1.4% |
| R | 3313 | 1.4% |
| O | 2485 | 1.1% |
| N | 874 | 0.4% |
| D | 306 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 396759 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 7234 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 7234 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 8079 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1769164 | |
| Common | 419306 | 19.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 347709 | |
| e | 216121 | |
| r | 197469 | |
| o | 191466 | |
| t | 185801 | |
| A | 184755 | |
| h | 181076 | |
| n | 46256 | 2.6% |
| a | 45644 | 2.6% |
| i | 40623 | 2.3% |
| Other values (17) | 132244 | 7.5% |
Common
| Value | Count | Frequency (%) |
| 396759 | ||
| - | 8079 | 1.9% |
| ( | 7234 | 1.7% |
| ) | 7234 | 1.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2188470 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 396759 | ||
| l | 347709 | |
| e | 216121 | |
| r | 197469 | |
| o | 191466 | |
| t | 185801 | |
| A | 184755 | |
| h | 181076 | |
| n | 46256 | 2.1% |
| a | 45644 | 2.1% |
| Other values (21) | 195414 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Female | |
|---|---|
| Male |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 6.042325947 |
| Min length | 5 |
Characters and Unicode
| Total characters | 1205583 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Female |
|---|---|
| 2nd row | Male |
| 3rd row | Female |
| 4th row | Female |
| 5th row | Female |
Common Values
| Value | Count | Frequency (%) |
| Female | 103984 | |
| Male | 95539 |
Length
Pie chart
| Value | Count | Frequency (%) |
| female | 103984 | |
| male | 95539 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 303507 | |
| 199523 | ||
| a | 199523 | |
| l | 199523 | |
| F | 103984 | 8.6% |
| m | 103984 | 8.6% |
| M | 95539 | 7.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 806537 | |
| Space Separator | 199523 | 16.5% |
| Uppercase Letter | 199523 | 16.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 303507 | |
| a | 199523 | |
| l | 199523 | |
| m | 103984 | 12.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 103984 | |
| M | 95539 |
Space Separator
| Value | Count | Frequency (%) |
| 199523 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1006060 | |
| Common | 199523 | 16.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 303507 | |
| a | 199523 | |
| l | 199523 | |
| F | 103984 | 10.3% |
| m | 103984 | 10.3% |
| M | 95539 | 9.5% |
Common
| Value | Count | Frequency (%) |
| 199523 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1205583 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 303507 | |
| 199523 | ||
| a | 199523 | |
| l | 199523 | |
| F | 103984 | 8.6% |
| m | 103984 | 8.6% |
| M | 95539 | 7.9% |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Not in universe | |
|---|---|
| No | 16034 |
| Yes | 3030 |
Length
| Max length | 16 |
|---|---|
| Median length | 16 |
| Mean length | 14.77306376 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2947566 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not in universe |
|---|---|
| 2nd row | Not in universe |
| 3rd row | Not in universe |
| 4th row | Not in universe |
| 5th row | Not in universe |
Common Values
| Value | Count | Frequency (%) |
| Not in universe | 180459 | |
| No | 16034 | 8.0% |
| Yes | 3030 | 1.5% |
Length
Pie chart
| Value | Count | Frequency (%) |
| not | 180459 | |
| in | 180459 | |
| universe | 180459 | |
| no | 16034 | 2.9% |
| yes | 3030 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 560441 | ||
| e | 363948 | |
| i | 360918 | |
| n | 360918 | |
| N | 196493 | 6.7% |
| o | 196493 | 6.7% |
| s | 183489 | 6.2% |
| t | 180459 | 6.1% |
| u | 180459 | 6.1% |
| v | 180459 | 6.1% |
| Other values (2) | 183489 | 6.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2187602 | |
| Space Separator | 560441 | 19.0% |
| Uppercase Letter | 199523 | 6.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 363948 | |
| i | 360918 | |
| n | 360918 | |
| o | 196493 | |
| s | 183489 | |
| t | 180459 | |
| u | 180459 | |
| v | 180459 | |
| r | 180459 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 196493 | |
| Y | 3030 | 1.5% |
Space Separator
| Value | Count | Frequency (%) |
| 560441 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2387125 | |
| Common | 560441 | 19.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 363948 | |
| i | 360918 | |
| n | 360918 | |
| N | 196493 | |
| o | 196493 | |
| s | 183489 | |
| t | 180459 | |
| u | 180459 | |
| v | 180459 | |
| r | 180459 |
Common
| Value | Count | Frequency (%) |
| 560441 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2947566 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 560441 | ||
| e | 363948 | |
| i | 360918 | |
| n | 360918 | |
| N | 196493 | 6.7% |
| o | 196493 | 6.7% |
| s | 183489 | 6.2% |
| t | 180459 | 6.1% |
| u | 180459 | 6.1% |
| v | 180459 | 6.1% |
| Other values (2) | 183489 | 6.2% |
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Not in universe | |
|---|---|
| Other job loser | 2038 |
| Re-entrant | 2019 |
| Job loser - on layoff | 976 |
| Job leaver | 598 |
Length
| Max length | 22 |
|---|---|
| Median length | 16 |
| Mean length | 15.9549676 |
| Min length | 11 |
Characters and Unicode
| Total characters | 3183383 |
|---|---|
| Distinct characters | 23 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not in universe |
|---|---|
| 2nd row | Not in universe |
| 3rd row | Not in universe |
| 4th row | Not in universe |
| 5th row | Not in universe |
Common Values
| Value | Count | Frequency (%) |
| Not in universe | 193453 | |
| Other job loser | 2038 | 1.0% |
| Re-entrant | 2019 | 1.0% |
| Job loser - on layoff | 976 | 0.5% |
| Job leaver | 598 | 0.3% |
| New entrant | 439 | 0.2% |
Length
Pie chart
| Value | Count | Frequency (%) |
| not | 193453 | |
| in | 193453 | |
| universe | 193453 | |
| job | 3612 | 0.6% |
| loser | 3014 | 0.5% |
| other | 2038 | 0.3% |
| re-entrant | 2019 | 0.3% |
| 976 | 0.2% | |
| on | 976 | 0.2% |
| layoff | 976 | 0.2% |
| Other values (3) | 1476 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 595446 | ||
| e | 398070 | |
| n | 392798 | |
| i | 386906 | |
| o | 202031 | 6.3% |
| r | 201561 | 6.3% |
| t | 200407 | 6.3% |
| s | 196467 | 6.2% |
| v | 194051 | 6.1% |
| N | 193892 | 6.1% |
| Other values (13) | 221754 | 7.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2385419 | |
| Space Separator | 595446 | 18.7% |
| Uppercase Letter | 199523 | 6.3% |
| Dash Punctuation | 2995 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 398070 | |
| n | 392798 | |
| i | 386906 | |
| o | 202031 | |
| r | 201561 | |
| t | 200407 | |
| s | 196467 | |
| v | 194051 | |
| u | 193453 | |
| l | 4588 | 0.2% |
| Other values (7) | 15087 | 0.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 193892 | |
| O | 2038 | 1.0% |
| R | 2019 | 1.0% |
| J | 1574 | 0.8% |
Space Separator
| Value | Count | Frequency (%) |
| 595446 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2995 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2584942 | |
| Common | 598441 | 18.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 398070 | |
| n | 392798 | |
| i | 386906 | |
| o | 202031 | |
| r | 201561 | |
| t | 200407 | |
| s | 196467 | |
| v | 194051 | |
| N | 193892 | |
| u | 193453 | |
| Other values (11) | 25306 | 1.0% |
Common
| Value | Count | Frequency (%) |
| 595446 | ||
| - | 2995 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3183383 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 595446 | ||
| e | 398070 | |
| n | 392798 | |
| i | 386906 | |
| o | 202031 | 6.3% |
| r | 201561 | 6.3% |
| t | 200407 | 6.3% |
| s | 196467 | 6.2% |
| v | 194051 | 6.1% |
| N | 193892 | 6.1% |
| Other values (13) | 221754 | 7.0% |
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Children or Armed Forces | |
|---|---|
| Full-time schedules | |
| Not in labor force | |
| PT for non-econ reasons usually FT | 3322 |
| Unemployed full-time | 2311 |
| Other values (3) | 2577 |
Length
| Max length | 35 |
|---|---|
| Median length | 25 |
| Mean length | 23.33263834 |
| Min length | 19 |
Characters and Unicode
| Total characters | 4655398 |
|---|---|
| Distinct characters | 27 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not in labor force |
|---|---|
| 2nd row | Children or Armed Forces |
| 3rd row | Not in labor force |
| 4th row | Children or Armed Forces |
| 5th row | Children or Armed Forces |
Common Values
| Value | Count | Frequency (%) |
| Children or Armed Forces | 123769 | |
| Full-time schedules | 40736 | 20.4% |
| Not in labor force | 26808 | 13.4% |
| PT for non-econ reasons usually FT | 3322 | 1.7% |
| Unemployed full-time | 2311 | 1.2% |
| PT for econ reasons usually PT | 1209 | 0.6% |
| Unemployed part- time | 843 | 0.4% |
| PT for econ reasons usually FT | 525 | 0.3% |
Length
Pie chart
| Value | Count | Frequency (%) |
| children | 123769 | |
| or | 123769 | |
| armed | 123769 | |
| forces | 123769 | |
| full-time | 43047 | 6.0% |
| schedules | 40736 | 5.6% |
| not | 26808 | 3.7% |
| labor | 26808 | 3.7% |
| force | 26808 | 3.7% |
| in | 26808 | 3.7% |
| Other values (10) | 35176 | 4.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 721267 | ||
| r | 559647 | |
| e | 539897 | |
| o | 349606 | 7.5% |
| d | 291428 | 6.3% |
| l | 290673 | 6.2% |
| s | 220409 | 4.7% |
| c | 196369 | 4.2% |
| i | 194467 | 4.2% |
| m | 170813 | 3.7% |
| Other values (17) | 1120822 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3424690 | |
| Space Separator | 721267 | 15.5% |
| Uppercase Letter | 462229 | 9.9% |
| Dash Punctuation | 47212 | 1.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 559647 | |
| e | 539897 | |
| o | 349606 | |
| d | 291428 | |
| l | 290673 | |
| s | 220409 | 6.4% |
| c | 196369 | 5.7% |
| i | 194467 | 5.7% |
| m | 170813 | 5.0% |
| n | 170487 | 5.0% |
| Other values (8) | 440894 |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 168352 | |
| C | 123769 | |
| A | 123769 | |
| N | 26808 | 5.8% |
| T | 10112 | 2.2% |
| P | 6265 | 1.4% |
| U | 3154 | 0.7% |
Space Separator
| Value | Count | Frequency (%) |
| 721267 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 47212 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3886919 | |
| Common | 768479 | 16.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 559647 | |
| e | 539897 | |
| o | 349606 | 9.0% |
| d | 291428 | 7.5% |
| l | 290673 | 7.5% |
| s | 220409 | 5.7% |
| c | 196369 | 5.1% |
| i | 194467 | 5.0% |
| m | 170813 | 4.4% |
| n | 170487 | 4.4% |
| Other values (15) | 903123 |
Common
| Value | Count | Frequency (%) |
| 721267 | ||
| - | 47212 | 6.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4655398 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 721267 | ||
| r | 559647 | |
| e | 539897 | |
| o | 349606 | 7.5% |
| d | 291428 | 6.3% |
| l | 290673 | 6.2% |
| s | 220409 | 4.7% |
| c | 196369 | 4.2% |
| i | 194467 | 4.2% |
| m | 170813 | 3.7% |
| Other values (17) | 1120822 |
| Distinct | 132 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 434.7189898 |
| Minimum | 0 |
|---|---|
| Maximum | 99999 |
| Zeros | 192144 |
| Zeros (%) | 96.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 99999 |
| Range | 99999 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 4697.53128 |
|---|---|
| Coefficient of variation (CV) | 10.8059031 |
| Kurtosis | 393.0628325 |
| Mean | 434.7189898 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 18.99082234 |
| Sum | 86736437 |
| Variance | 22066800.12 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 192144 | |
| 15024 | 788 | 0.4% |
| 7688 | 609 | 0.3% |
| 7298 | 582 | 0.3% |
| 99999 | 390 | 0.2% |
| 3103 | 237 | 0.1% |
| 5178 | 207 | 0.1% |
| 5013 | 158 | 0.1% |
| 4386 | 151 | 0.1% |
| 3325 | 121 | 0.1% |
| Other values (122) | 4136 | 2.1% |
| Value | Count | Frequency (%) |
| 0 | 192144 | |
| 114 | 11 | < 0.1% |
| 401 | 33 | < 0.1% |
| 594 | 88 | < 0.1% |
| 914 | 17 | < 0.1% |
| 991 | 59 | < 0.1% |
| 1055 | 69 | < 0.1% |
| 1086 | 81 | < 0.1% |
| 1090 | 2 | < 0.1% |
| 1111 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 99999 | 390 | |
| 41310 | 2 | < 0.1% |
| 34095 | 11 | < 0.1% |
| 27828 | 94 | < 0.1% |
| 25236 | 23 | < 0.1% |
| 25124 | 18 | < 0.1% |
| 22040 | 2 | < 0.1% |
| 20051 | 91 | < 0.1% |
| 18481 | 14 | < 0.1% |
| 15831 | 16 | < 0.1% |
| Distinct | 113 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 37.31378839 |
| Minimum | 0 |
|---|---|
| Maximum | 4608 |
| Zeros | 195617 |
| Zeros (%) | 98.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 4608 |
| Range | 4608 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 271.8964284 |
|---|---|
| Coefficient of variation (CV) | 7.286754847 |
| Kurtosis | 61.63293305 |
| Mean | 37.31378839 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 7.6325647 |
| Sum | 7444959 |
| Variance | 73927.66776 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 195617 | |
| 1902 | 407 | 0.2% |
| 1977 | 381 | 0.2% |
| 1887 | 364 | 0.2% |
| 1602 | 193 | 0.1% |
| 2415 | 122 | 0.1% |
| 1485 | 95 | < 0.1% |
| 1848 | 88 | < 0.1% |
| 1876 | 87 | < 0.1% |
| 1672 | 85 | < 0.1% |
| Other values (103) | 2084 | 1.0% |
| Value | Count | Frequency (%) |
| 0 | 195617 | |
| 155 | 1 | < 0.1% |
| 213 | 10 | < 0.1% |
| 323 | 10 | < 0.1% |
| 419 | 29 | < 0.1% |
| 625 | 25 | < 0.1% |
| 653 | 7 | < 0.1% |
| 772 | 5 | < 0.1% |
| 810 | 5 | < 0.1% |
| 880 | 9 | < 0.1% |
| Value | Count | Frequency (%) |
| 4608 | 4 | < 0.1% |
| 4356 | 30 | |
| 3900 | 2 | < 0.1% |
| 3770 | 5 | < 0.1% |
| 3683 | 4 | < 0.1% |
| 3500 | 10 | < 0.1% |
| 3175 | 8 | < 0.1% |
| 3004 | 11 | < 0.1% |
| 2824 | 27 | |
| 2788 | 7 | < 0.1% |
| Distinct | 1478 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 197.5295329 |
| Minimum | 0 |
|---|---|
| Maximum | 99999 |
| Zeros | 178382 |
| Zeros (%) | 89.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 400 |
| Maximum | 99999 |
| Range | 99999 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1984.163658 |
|---|---|
| Coefficient of variation (CV) | 10.04489622 |
| Kurtosis | 1090.563754 |
| Mean | 197.5295329 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 27.78650179 |
| Sum | 39411685 |
| Variance | 3936905.423 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 178382 | |
| 100 | 1148 | 0.6% |
| 500 | 1030 | 0.5% |
| 1000 | 894 | 0.4% |
| 200 | 866 | 0.4% |
| 50 | 832 | 0.4% |
| 2000 | 574 | 0.3% |
| 250 | 555 | 0.3% |
| 150 | 549 | 0.3% |
| 300 | 523 | 0.3% |
| Other values (1468) | 14170 | 7.1% |
| Value | Count | Frequency (%) |
| 0 | 178382 | |
| 1 | 472 | 0.2% |
| 2 | 193 | 0.1% |
| 3 | 129 | 0.1% |
| 4 | 75 | < 0.1% |
| 5 | 179 | 0.1% |
| 6 | 100 | 0.1% |
| 7 | 93 | < 0.1% |
| 8 | 94 | < 0.1% |
| 9 | 56 | < 0.1% |
| Value | Count | Frequency (%) |
| 99999 | 25 | |
| 95095 | 1 | < 0.1% |
| 75000 | 5 | < 0.1% |
| 70000 | 3 | < 0.1% |
| 66621 | 2 | < 0.1% |
| 60000 | 7 | < 0.1% |
| 57678 | 1 | < 0.1% |
| 55000 | 1 | < 0.1% |
| 54600 | 2 | < 0.1% |
| 54500 | 2 | < 0.1% |
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Nonfiler | |
|---|---|
| Joint both under 65 | |
| Single | |
| Joint both 65+ | |
| Head of household | 7426 |
Length
| Max length | 29 |
|---|---|
| Median length | 9 |
| Mean length | 13.31297144 |
| Min length | 7 |
Characters and Unicode
| Total characters | 2656244 |
|---|---|
| Distinct characters | 24 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Nonfiler |
|---|---|
| 2nd row | Head of household |
| 3rd row | Nonfiler |
| 4th row | Nonfiler |
| 5th row | Nonfiler |
Common Values
| Value | Count | Frequency (%) |
| Nonfiler | 75094 | |
| Joint both under 65 | 67383 | |
| Single | 37421 | |
| Joint both 65+ | 8332 | 4.2% |
| Head of household | 7426 | 3.7% |
| Joint one under 65 & one 65+ | 3867 | 1.9% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 65 | 83449 | |
| joint | 79582 | |
| both | 75715 | |
| nonfiler | 75094 | |
| under | 71250 | |
| single | 37421 | |
| one | 7734 | 1.7% |
| head | 7426 | 1.6% |
| of | 7426 | 1.6% |
| household | 7426 | 1.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 456390 | ||
| n | 271081 | |
| o | 260403 | 9.8% |
| e | 206351 | 7.8% |
| i | 192097 | 7.2% |
| t | 155297 | 5.8% |
| r | 146344 | 5.5% |
| l | 119941 | 4.5% |
| h | 90567 | 3.4% |
| d | 86102 | 3.2% |
| Other values (14) | 671671 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1817367 | |
| Space Separator | 456390 | 17.2% |
| Uppercase Letter | 199523 | 7.5% |
| Decimal Number | 166898 | 6.3% |
| Math Symbol | 12199 | 0.5% |
| Other Punctuation | 3867 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 271081 | |
| o | 260403 | |
| e | 206351 | |
| i | 192097 | |
| t | 155297 | |
| r | 146344 | |
| l | 119941 | |
| h | 90567 | 5.0% |
| d | 86102 | 4.7% |
| f | 82520 | 4.5% |
| Other values (5) | 206664 |
Uppercase Letter
| Value | Count | Frequency (%) |
| J | 79582 | |
| N | 75094 | |
| S | 37421 | |
| H | 7426 | 3.7% |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 83449 | |
| 5 | 83449 |
Space Separator
| Value | Count | Frequency (%) |
| 456390 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 12199 |
Other Punctuation
| Value | Count | Frequency (%) |
| & | 3867 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2016890 | |
| Common | 639354 | 24.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 271081 | |
| o | 260403 | |
| e | 206351 | |
| i | 192097 | |
| t | 155297 | 7.7% |
| r | 146344 | 7.3% |
| l | 119941 | 5.9% |
| h | 90567 | 4.5% |
| d | 86102 | 4.3% |
| f | 82520 | 4.1% |
| Other values (9) | 406187 |
Common
| Value | Count | Frequency (%) |
| 456390 | ||
| 6 | 83449 | 13.1% |
| 5 | 83449 | 13.1% |
| + | 12199 | 1.9% |
| & | 3867 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2656244 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 456390 | ||
| n | 271081 | |
| o | 260403 | 9.8% |
| e | 206351 | 7.8% |
| i | 192097 | 7.2% |
| t | 155297 | 5.8% |
| r | 146344 | 5.5% |
| l | 119941 | 4.5% |
| h | 90567 | 3.4% |
| d | 86102 | 3.2% |
| Other values (14) | 671671 |
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Not in universe | |
|---|---|
| South | 4889 |
| West | 4074 |
| Midwest | 3575 |
| Northeast | 2705 |
Length
| Max length | 16 |
|---|---|
| Median length | 16 |
| Mean length | 15.28176701 |
| Min length | 5 |
Characters and Unicode
| Total characters | 3049064 |
|---|---|
| Distinct characters | 20 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not in universe |
|---|---|
| 2nd row | South |
| 3rd row | Not in universe |
| 4th row | Not in universe |
| 5th row | Not in universe |
Common Values
| Value | Count | Frequency (%) |
| Not in universe | 183750 | |
| South | 4889 | 2.5% |
| West | 4074 | 2.0% |
| Midwest | 3575 | 1.8% |
| Northeast | 2705 | 1.4% |
| Abroad | 530 | 0.3% |
Length
Pie chart
| Value | Count | Frequency (%) |
| not | 183750 | |
| in | 183750 | |
| universe | 183750 | |
| south | 4889 | 0.9% |
| west | 4074 | 0.7% |
| midwest | 3575 | 0.6% |
| northeast | 2705 | 0.5% |
| abroad | 530 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 567023 | ||
| e | 377854 | |
| i | 371075 | |
| n | 367500 | |
| t | 201698 | 6.6% |
| s | 194104 | 6.4% |
| o | 191874 | 6.3% |
| u | 188639 | 6.2% |
| r | 186985 | 6.1% |
| N | 186455 | 6.1% |
| Other values (10) | 215857 | 7.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2282518 | |
| Space Separator | 567023 | 18.6% |
| Uppercase Letter | 199523 | 6.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 377854 | |
| i | 371075 | |
| n | 367500 | |
| t | 201698 | |
| s | 194104 | |
| o | 191874 | |
| u | 188639 | |
| r | 186985 | |
| v | 183750 | |
| h | 7594 | 0.3% |
| Other values (4) | 11445 | 0.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 186455 | |
| S | 4889 | 2.5% |
| W | 4074 | 2.0% |
| M | 3575 | 1.8% |
| A | 530 | 0.3% |
Space Separator
| Value | Count | Frequency (%) |
| 567023 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2482041 | |
| Common | 567023 | 18.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 377854 | |
| i | 371075 | |
| n | 367500 | |
| t | 201698 | |
| s | 194104 | |
| o | 191874 | |
| u | 188639 | |
| r | 186985 | |
| N | 186455 | |
| v | 183750 | |
| Other values (9) | 32107 | 1.3% |
Common
| Value | Count | Frequency (%) |
| 567023 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3049064 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 567023 | ||
| e | 377854 | |
| i | 371075 | |
| n | 367500 | |
| t | 201698 | 6.6% |
| s | 194104 | 6.4% |
| o | 191874 | 6.3% |
| u | 188639 | 6.2% |
| r | 186985 | 6.1% |
| N | 186455 | 6.1% |
| Other values (10) | 215857 | 7.1% |
| Distinct | 51 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Not in universe | |
|---|---|
| California | 1714 |
| Utah | 1063 |
| Florida | 849 |
| North Carolina | 812 |
| Other values (46) | 11335 |
Length
| Max length | 21 |
|---|---|
| Median length | 16 |
| Mean length | 15.45687465 |
| Min length | 2 |
Characters and Unicode
| Total characters | 3084002 |
|---|---|
| Distinct characters | 46 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not in universe |
|---|---|
| 2nd row | Arkansas |
| 3rd row | Not in universe |
| 4th row | Not in universe |
| 5th row | Not in universe |
Common Values
| Value | Count | Frequency (%) |
| Not in universe | 183750 | |
| California | 1714 | 0.9% |
| Utah | 1063 | 0.5% |
| Florida | 849 | 0.4% |
| North Carolina | 812 | 0.4% |
| ? | 708 | 0.4% |
| Abroad | 671 | 0.3% |
| Oklahoma | 626 | 0.3% |
| Minnesota | 576 | 0.3% |
| Indiana | 533 | 0.3% |
| Other values (41) | 8221 | 4.1% |
Length
| Value | Count | Frequency (%) |
| not | 183750 | |
| in | 183750 | |
| universe | 183750 | |
| california | 1714 | 0.3% |
| north | 1311 | 0.2% |
| utah | 1063 | 0.2% |
| new | 975 | 0.2% |
| carolina | 907 | 0.2% |
| florida | 849 | 0.1% |
| 708 | 0.1% | |
| Other values (46) | 11228 | 2.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 570005 | ||
| i | 380324 | |
| n | 377218 | |
| e | 373184 | |
| o | 195445 | 6.3% |
| r | 192090 | 6.2% |
| s | 189330 | 6.1% |
| t | 189230 | 6.1% |
| N | 186388 | 6.0% |
| u | 184978 | 6.0% |
| Other values (36) | 245810 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2311608 | |
| Space Separator | 570005 | 18.5% |
| Uppercase Letter | 201681 | 6.5% |
| Other Punctuation | 708 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 380324 | |
| n | 377218 | |
| e | 373184 | |
| o | 195445 | |
| r | 192090 | |
| s | 189330 | |
| t | 189230 | |
| u | 184978 | |
| v | 184123 | |
| a | 19048 | 0.8% |
| Other values (14) | 26638 | 1.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 186388 | |
| C | 3093 | 1.5% |
| M | 2539 | 1.3% |
| A | 1625 | 0.8% |
| O | 1073 | 0.5% |
| U | 1063 | 0.5% |
| I | 933 | 0.5% |
| F | 849 | 0.4% |
| D | 826 | 0.4% |
| W | 577 | 0.3% |
| Other values (10) | 2715 | 1.3% |
Space Separator
| Value | Count | Frequency (%) |
| 570005 |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 708 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2513289 | |
| Common | 570713 | 18.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 380324 | |
| n | 377218 | |
| e | 373184 | |
| o | 195445 | |
| r | 192090 | |
| s | 189330 | |
| t | 189230 | |
| N | 186388 | |
| u | 184978 | |
| v | 184123 | |
| Other values (34) | 60979 | 2.4% |
Common
| Value | Count | Frequency (%) |
| 570005 | ||
| ? | 708 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3084002 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 570005 | ||
| i | 380324 | |
| n | 377218 | |
| e | 373184 | |
| o | 195445 | 6.3% |
| r | 192090 | 6.2% |
| s | 189330 | 6.1% |
| t | 189230 | 6.1% |
| N | 186388 | 6.0% |
| u | 184978 | 6.0% |
| Other values (36) | 245810 |
| Distinct | 38 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Householder | |
|---|---|
| Child <18 never marr not in subfamily | |
| Spouse of householder | |
| Nonfamily householder | |
| Child 18+ never marr Not in a subfamily | |
| Other values (33) |
Length
| Max length | 48 |
|---|---|
| Median length | 22 |
| Mean length | 25.71388762 |
| Min length | 12 |
Characters and Unicode
| Total characters | 5130512 |
|---|---|
| Distinct characters | 35 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Other Rel 18+ ever marr not in subfamily |
|---|---|
| 2nd row | Householder |
| 3rd row | Child 18+ never marr Not in a subfamily |
| 4th row | Child <18 never marr not in subfamily |
| 5th row | Child <18 never marr not in subfamily |
Common Values
| Value | Count | Frequency (%) |
| Householder | 53248 | |
| Child <18 never marr not in subfamily | 50326 | |
| Spouse of householder | 41695 | |
| Nonfamily householder | 22213 | |
| Child 18+ never marr Not in a subfamily | 12030 | 6.0% |
| Secondary individual | 6122 | 3.1% |
| Other Rel 18+ ever marr not in subfamily | 1956 | 1.0% |
| Grandchild <18 never marr child of subfamily RP | 1868 | 0.9% |
| Other Rel 18+ never marr not in subfamily | 1728 | 0.9% |
| Grandchild <18 never marr not in subfamily | 1066 | 0.5% |
| Other values (28) | 7271 | 3.6% |
Length
| Value | Count | Frequency (%) |
| householder | 117156 | |
| subfamily | 76049 | |
| 18 | 75312 | |
| marr | 73797 | |
| never | 69408 | |
| in | 69347 | |
| not | 69151 | |
| child | 68138 | |
| of | 49377 | |
| spouse | 42526 | 5.4% |
| Other values (15) | 77412 |
Most occurring characters
| Value | Count | Frequency (%) |
| 787673 | ||
| e | 446352 | 8.7% |
| o | 423897 | 8.3% |
| r | 357168 | 7.0% |
| l | 300845 | 5.9% |
| h | 258900 | 5.0% |
| i | 257293 | 5.0% |
| u | 244446 | 4.8% |
| s | 236706 | 4.6% |
| n | 234893 | 4.6% |
| Other values (25) | 1582339 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3885632 | |
| Space Separator | 787673 | 15.4% |
| Uppercase Letter | 232003 | 4.5% |
| Decimal Number | 150624 | 2.9% |
| Math Symbol | 74580 | 1.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 446352 | |
| o | 423897 | |
| r | 357168 | 9.2% |
| l | 300845 | 7.7% |
| h | 258900 | 6.7% |
| i | 257293 | 6.6% |
| u | 244446 | 6.3% |
| s | 236706 | 6.1% |
| n | 234893 | 6.0% |
| d | 211877 | 5.5% |
| Other values (11) | 913255 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 65614 | |
| H | 53248 | |
| S | 47869 | |
| N | 35256 | |
| R | 13224 | 5.7% |
| P | 6898 | 3.0% |
| O | 6326 | 2.7% |
| G | 3372 | 1.5% |
| I | 196 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 75312 | |
| 8 | 75312 |
Math Symbol
| Value | Count | Frequency (%) |
| < | 54645 | |
| + | 19935 | 26.7% |
Space Separator
| Value | Count | Frequency (%) |
| 787673 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4117635 | |
| Common | 1012877 | 19.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 446352 | 10.8% |
| o | 423897 | 10.3% |
| r | 357168 | 8.7% |
| l | 300845 | 7.3% |
| h | 258900 | 6.3% |
| i | 257293 | 6.2% |
| u | 244446 | 5.9% |
| s | 236706 | 5.7% |
| n | 234893 | 5.7% |
| d | 211877 | 5.1% |
| Other values (20) | 1145258 |
Common
| Value | Count | Frequency (%) |
| 787673 | ||
| 1 | 75312 | 7.4% |
| 8 | 75312 | 7.4% |
| < | 54645 | 5.4% |
| + | 19935 | 2.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5130512 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 787673 | ||
| e | 446352 | 8.7% |
| o | 423897 | 8.3% |
| r | 357168 | 7.0% |
| l | 300845 | 5.9% |
| h | 258900 | 5.0% |
| i | 257293 | 5.0% |
| u | 244446 | 4.8% |
| s | 236706 | 4.6% |
| n | 234893 | 4.6% |
| Other values (25) | 1582339 |
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Householder | |
|---|---|
| Child under 18 never married | |
| Spouse of householder | |
| Child 18 or older | |
| Other relative of householder | |
| Other values (3) |
Length
| Max length | 37 |
|---|---|
| Median length | 22 |
| Mean length | 20.28793172 |
| Min length | 12 |
Characters and Unicode
| Total characters | 4047909 |
|---|---|
| Distinct characters | 29 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Other relative of householder |
|---|---|
| 2nd row | Householder |
| 3rd row | Child 18 or older |
| 4th row | Child under 18 never married |
| 5th row | Child under 18 never married |
Common Values
| Value | Count | Frequency (%) |
| Householder | 75475 | |
| Child under 18 never married | 50426 | |
| Spouse of householder | 41709 | |
| Child 18 or older | 14430 | 7.2% |
| Other relative of householder | 9703 | 4.9% |
| Nonrelative of householder | 7601 | 3.8% |
| Group Quarters- Secondary individual | 132 | 0.1% |
| Child under 18 ever married | 47 | < 0.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| householder | 134488 | |
| child | 64903 | |
| 18 | 64903 | |
| of | 59013 | |
| married | 50473 | 8.8% |
| under | 50473 | 8.8% |
| never | 50426 | 8.8% |
| spouse | 41709 | 7.3% |
| or | 14430 | 2.5% |
| older | 14430 | 2.5% |
| Other values (8) | 27582 | 4.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 572830 | ||
| e | 571582 | |
| o | 406423 | |
| r | 392775 | |
| d | 315163 | |
| h | 268107 | 6.6% |
| l | 231257 | 5.7% |
| u | 227066 | 5.6% |
| s | 176329 | 4.4% |
| i | 133076 | 3.3% |
| Other values (19) | 753301 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3145354 | |
| Space Separator | 572830 | 14.2% |
| Uppercase Letter | 199787 | 4.9% |
| Decimal Number | 129806 | 3.2% |
| Dash Punctuation | 132 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 571582 | |
| o | 406423 | |
| r | 392775 | |
| d | 315163 | |
| h | 268107 | |
| l | 231257 | |
| u | 227066 | 7.2% |
| s | 176329 | 5.6% |
| i | 133076 | 4.2% |
| n | 108764 | 3.5% |
| Other values (8) | 314812 |
Uppercase Letter
| Value | Count | Frequency (%) |
| H | 75475 | |
| C | 64903 | |
| S | 41841 | |
| O | 9703 | 4.9% |
| N | 7601 | 3.8% |
| G | 132 | 0.1% |
| Q | 132 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 64903 | |
| 8 | 64903 |
Space Separator
| Value | Count | Frequency (%) |
| 572830 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 132 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3345141 | |
| Common | 702768 | 17.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 571582 | |
| o | 406423 | |
| r | 392775 | |
| d | 315163 | |
| h | 268107 | |
| l | 231257 | |
| u | 227066 | 6.8% |
| s | 176329 | 5.3% |
| i | 133076 | 4.0% |
| n | 108764 | 3.3% |
| Other values (15) | 514599 |
Common
| Value | Count | Frequency (%) |
| 572830 | ||
| 1 | 64903 | 9.2% |
| 8 | 64903 | 9.2% |
| - | 132 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4047909 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 572830 | ||
| e | 571582 | |
| o | 406423 | |
| r | 392775 | |
| d | 315163 | |
| h | 268107 | 6.6% |
| l | 231257 | 5.7% |
| u | 227066 | 5.6% |
| s | 176329 | 4.4% |
| i | 133076 | 3.3% |
| Other values (19) | 753301 |
| Distinct | 99800 |
|---|---|
| Distinct (%) | 50.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1740.380269 |
| Minimum | 37.87 |
|---|---|
| Maximum | 18656.3 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 37.87 |
|---|---|
| 5-th percentile | 395.342 |
| Q1 | 1061.615 |
| median | 1618.31 |
| Q3 | 2188.61 |
| 95-th percentile | 3585.909 |
| Maximum | 18656.3 |
| Range | 18618.43 |
| Interquartile range (IQR) | 1126.995 |
Descriptive statistics
| Standard deviation | 993.7681558 |
|---|---|
| Coefficient of variation (CV) | 0.5710063331 |
| Kurtosis | 5.412514036 |
| Mean | 1740.380269 |
| Median Absolute Deviation (MAD) | 561.46 |
| Skewness | 1.432733152 |
| Sum | 347245892.5 |
| Variance | 987575.1475 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1601.4 | 32 | < 0.1% |
| 753.23 | 32 | < 0.1% |
| 1191.21 | 32 | < 0.1% |
| 1787.34 | 32 | < 0.1% |
| 1317.51 | 31 | < 0.1% |
| 707.9 | 31 | < 0.1% |
| 1070.15 | 30 | < 0.1% |
| 1009.39 | 28 | < 0.1% |
| 1002.02 | 28 | < 0.1% |
| 1839.19 | 28 | < 0.1% |
| Other values (99790) | 199219 |
| Value | Count | Frequency (%) |
| 37.87 | 1 | < 0.1% |
| 39.11 | 1 | < 0.1% |
| 40.67 | 2 | < 0.1% |
| 42.82 | 2 | < 0.1% |
| 43.26 | 3 | |
| 45.74 | 2 | < 0.1% |
| 47.83 | 6 | |
| 49.82 | 2 | < 0.1% |
| 52.43 | 1 | < 0.1% |
| 52.46 | 4 |
| Value | Count | Frequency (%) |
| 18656.3 | 1 | |
| 16349.2 | 1 | |
| 13911.5 | 1 | |
| 13145.1 | 1 | |
| 13114.2 | 1 | |
| 12960.2 | 1 | |
| 12399.9 | 1 | |
| 12184.5 | 1 | |
| 11958.4 | 1 | |
| 11863 | 1 |
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| ? | |
|---|---|
| Nonmover | |
| MSA to MSA | |
| NonMSA to nonMSA | 2811 |
| Not in universe | 1516 |
| Other values (5) | 2361 |
Length
| Max length | 17 |
|---|---|
| Median length | 9 |
| Mean length | 5.841186229 |
| Min length | 2 |
Characters and Unicode
| Total characters | 1165451 |
|---|---|
| Distinct characters | 21 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ? |
|---|---|
| 2nd row | MSA to MSA |
| 3rd row | ? |
| 4th row | Nonmover |
| 5th row | Nonmover |
Common Values
| Value | Count | Frequency (%) |
| ? | 99696 | |
| Nonmover | 82538 | |
| MSA to MSA | 10601 | 5.3% |
| NonMSA to nonMSA | 2811 | 1.4% |
| Not in universe | 1516 | 0.8% |
| MSA to nonMSA | 790 | 0.4% |
| NonMSA to MSA | 615 | 0.3% |
| Abroad to MSA | 453 | 0.2% |
| Not identifiable | 430 | 0.2% |
| Abroad to nonMSA | 73 | < 0.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 99696 | ||
| nonmover | 82538 | |
| msa | 23060 | 9.9% |
| to | 15343 | 6.6% |
| nonmsa | 7100 | 3.0% |
| not | 1946 | 0.8% |
| in | 1516 | 0.6% |
| universe | 1516 | 0.6% |
| abroad | 526 | 0.2% |
| identifiable | 430 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 233671 | ||
| o | 189991 | |
| ? | 99696 | |
| n | 96774 | |
| N | 87910 | 7.5% |
| e | 86430 | 7.4% |
| r | 84580 | 7.3% |
| v | 84054 | 7.2% |
| m | 82538 | 7.1% |
| A | 30686 | 2.6% |
| Other values (11) | 89121 | 7.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 653168 | |
| Space Separator | 233671 | 20.0% |
| Uppercase Letter | 178916 | 15.4% |
| Other Punctuation | 99696 | 8.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 189991 | |
| n | 96774 | |
| e | 86430 | |
| r | 84580 | |
| v | 84054 | |
| m | 82538 | |
| t | 17719 | 2.7% |
| i | 4322 | 0.7% |
| u | 1516 | 0.2% |
| s | 1516 | 0.2% |
| Other values (5) | 3728 | 0.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 87910 | |
| A | 30686 | 17.2% |
| M | 30160 | 16.9% |
| S | 30160 | 16.9% |
Space Separator
| Value | Count | Frequency (%) |
| 233671 |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 99696 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 832084 | |
| Common | 333367 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 189991 | |
| n | 96774 | |
| N | 87910 | |
| e | 86430 | |
| r | 84580 | |
| v | 84054 | |
| m | 82538 | |
| A | 30686 | 3.7% |
| M | 30160 | 3.6% |
| S | 30160 | 3.6% |
| Other values (9) | 28801 | 3.5% |
Common
| Value | Count | Frequency (%) |
| 233671 | ||
| ? | 99696 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1165451 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 233671 | ||
| o | 189991 | |
| ? | 99696 | |
| n | 96774 | |
| N | 87910 | 7.5% |
| e | 86430 | 7.4% |
| r | 84580 | 7.3% |
| v | 84054 | 7.2% |
| m | 82538 | 7.1% |
| A | 30686 | 2.6% |
| Other values (11) | 89121 | 7.6% |
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| ? | |
|---|---|
| Nonmover | |
| Same county | 9812 |
| Different county same state | 2797 |
| Not in universe | 1516 |
| Other values (4) | 3164 |
Length
| Max length | 31 |
|---|---|
| Median length | 7 |
| Mean length | 6.166862968 |
| Min length | 2 |
Characters and Unicode
| Total characters | 1230431 |
|---|---|
| Distinct characters | 23 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ? |
|---|---|
| 2nd row | Same county |
| 3rd row | ? |
| 4th row | Nonmover |
| 5th row | Nonmover |
Common Values
| Value | Count | Frequency (%) |
| ? | 99696 | |
| Nonmover | 82538 | |
| Same county | 9812 | 4.9% |
| Different county same state | 2797 | 1.4% |
| Not in universe | 1516 | 0.8% |
| Different region | 1178 | 0.6% |
| Different state same division | 991 | 0.5% |
| Abroad | 530 | 0.3% |
| Different division same region | 465 | 0.2% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 99696 | ||
| nonmover | 82538 | |
| same | 14065 | 6.2% |
| county | 12609 | 5.6% |
| different | 5431 | 2.4% |
| state | 3788 | 1.7% |
| region | 1643 | 0.7% |
| not | 1516 | 0.7% |
| in | 1516 | 0.7% |
| universe | 1516 | 0.7% |
| Other values (2) | 1986 | 0.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 226304 | ||
| o | 182830 | |
| e | 115928 | |
| n | 106709 | |
| ? | 99696 | |
| m | 96603 | |
| r | 91658 | |
| v | 85510 | 6.9% |
| N | 84054 | 6.8% |
| t | 27132 | 2.2% |
| Other values (13) | 114007 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 804604 | |
| Space Separator | 226304 | 18.4% |
| Uppercase Letter | 99827 | 8.1% |
| Other Punctuation | 99696 | 8.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 182830 | |
| e | 115928 | |
| n | 106709 | |
| m | 96603 | |
| r | 91658 | |
| v | 85510 | |
| t | 27132 | 3.4% |
| a | 18383 | 2.3% |
| i | 14474 | 1.8% |
| u | 14125 | 1.8% |
| Other values (7) | 51252 | 6.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 84054 | |
| S | 9812 | 9.8% |
| D | 5431 | 5.4% |
| A | 530 | 0.5% |
Space Separator
| Value | Count | Frequency (%) |
| 226304 |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 99696 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 904431 | |
| Common | 326000 | 26.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 182830 | |
| e | 115928 | |
| n | 106709 | |
| m | 96603 | |
| r | 91658 | |
| v | 85510 | |
| N | 84054 | |
| t | 27132 | 3.0% |
| a | 18383 | 2.0% |
| i | 14474 | 1.6% |
| Other values (11) | 81150 |
Common
| Value | Count | Frequency (%) |
| 226304 | ||
| ? | 99696 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1230431 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 226304 | ||
| o | 182830 | |
| e | 115928 | |
| n | 106709 | |
| ? | 99696 | |
| m | 96603 | |
| r | 91658 | |
| v | 85510 | 6.9% |
| N | 84054 | 6.8% |
| t | 27132 | 2.2% |
| Other values (13) | 114007 |
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| ? | |
|---|---|
| Nonmover | |
| Same county | 9812 |
| Different county same state | 2797 |
| Not in universe | 1516 |
| Other values (5) | 3164 |
Length
| Max length | 29 |
|---|---|
| Median length | 7 |
| Mean length | 6.186038702 |
| Min length | 2 |
Characters and Unicode
| Total characters | 1234257 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ? |
|---|---|
| 2nd row | Same county |
| 3rd row | ? |
| 4th row | Nonmover |
| 5th row | Nonmover |
Common Values
| Value | Count | Frequency (%) |
| ? | 99696 | |
| Nonmover | 82538 | |
| Same county | 9812 | 4.9% |
| Different county same state | 2797 | 1.4% |
| Not in universe | 1516 | 0.8% |
| Different state in South | 973 | 0.5% |
| Different state in West | 679 | 0.3% |
| Different state in Midwest | 551 | 0.3% |
| Abroad | 530 | 0.3% |
| Different state in Northeast | 431 | 0.2% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 99696 | ||
| nonmover | 82538 | |
| same | 12609 | 5.5% |
| county | 12609 | 5.5% |
| different | 5431 | 2.4% |
| state | 5431 | 2.4% |
| in | 4150 | 1.8% |
| not | 1516 | 0.7% |
| universe | 1516 | 0.7% |
| south | 973 | 0.4% |
| Other values (4) | 2191 | 1.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 228660 | ||
| o | 181135 | |
| e | 116133 | |
| n | 106244 | |
| ? | 99696 | |
| m | 95147 | |
| r | 90446 | 7.3% |
| N | 84485 | 6.8% |
| v | 84054 | 6.8% |
| t | 33483 | 2.7% |
| Other values (16) | 114774 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 803440 | |
| Space Separator | 228660 | 18.5% |
| Uppercase Letter | 102461 | 8.3% |
| Other Punctuation | 99696 | 8.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 181135 | |
| e | 116133 | |
| n | 106244 | |
| m | 95147 | |
| r | 90446 | |
| v | 84054 | |
| t | 33483 | 4.2% |
| a | 19001 | 2.4% |
| u | 15098 | 1.9% |
| c | 12609 | 1.6% |
| Other values (8) | 50090 | 6.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 84485 | |
| S | 10785 | 10.5% |
| D | 5431 | 5.3% |
| W | 679 | 0.7% |
| M | 551 | 0.5% |
| A | 530 | 0.5% |
Space Separator
| Value | Count | Frequency (%) |
| 228660 |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 99696 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 905901 | |
| Common | 328356 | 26.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 181135 | |
| e | 116133 | |
| n | 106244 | |
| m | 95147 | |
| r | 90446 | |
| N | 84485 | |
| v | 84054 | |
| t | 33483 | 3.7% |
| a | 19001 | 2.1% |
| u | 15098 | 1.7% |
| Other values (14) | 80675 |
Common
| Value | Count | Frequency (%) |
| 228660 | ||
| ? | 99696 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1234257 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 228660 | ||
| o | 181135 | |
| e | 116133 | |
| n | 106244 | |
| ? | 99696 | |
| m | 95147 | |
| r | 90446 | 7.3% |
| N | 84485 | 6.8% |
| v | 84054 | 6.8% |
| t | 33483 | 2.7% |
| Other values (16) | 114774 |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Not in universe under 1 year old | |
|---|---|
| Yes | |
| No |
Length
| Max length | 33 |
|---|---|
| Median length | 33 |
| Mean length | 18.63177178 |
| Min length | 3 |
Characters and Unicode
| Total characters | 3717467 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not in universe under 1 year old |
|---|---|
| 2nd row | No |
| 3rd row | Not in universe under 1 year old |
| 4th row | Yes |
| 5th row | Yes |
Common Values
| Value | Count | Frequency (%) |
| Not in universe under 1 year old | 101212 | |
| Yes | 82538 | |
| No | 15773 | 7.9% |
Length
Pie chart
| Value | Count | Frequency (%) |
| not | 101212 | |
| in | 101212 | |
| universe | 101212 | |
| under | 101212 | |
| 1 | 101212 | |
| year | 101212 | |
| old | 101212 | |
| yes | 82538 | |
| no | 15773 | 2.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 806795 | ||
| e | 487386 | |
| n | 303636 | 8.2% |
| r | 303636 | 8.2% |
| o | 218197 | 5.9% |
| i | 202424 | 5.4% |
| u | 202424 | 5.4% |
| d | 202424 | 5.4% |
| s | 183750 | 4.9% |
| N | 116985 | 3.1% |
| Other values (7) | 689810 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2609937 | |
| Space Separator | 806795 | 21.7% |
| Uppercase Letter | 199523 | 5.4% |
| Decimal Number | 101212 | 2.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 487386 | |
| n | 303636 | |
| r | 303636 | |
| o | 218197 | |
| i | 202424 | |
| u | 202424 | |
| d | 202424 | |
| s | 183750 | 7.0% |
| t | 101212 | 3.9% |
| v | 101212 | 3.9% |
| Other values (3) | 303636 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 116985 | |
| Y | 82538 |
Space Separator
| Value | Count | Frequency (%) |
| 806795 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 101212 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2809460 | |
| Common | 908007 | 24.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 487386 | |
| n | 303636 | |
| r | 303636 | |
| o | 218197 | |
| i | 202424 | |
| u | 202424 | |
| d | 202424 | |
| s | 183750 | 6.5% |
| N | 116985 | 4.2% |
| t | 101212 | 3.6% |
| Other values (5) | 487386 |
Common
| Value | Count | Frequency (%) |
| 806795 | ||
| 1 | 101212 | 11.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3717467 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 806795 | ||
| e | 487386 | |
| n | 303636 | 8.2% |
| r | 303636 | 8.2% |
| o | 218197 | 5.9% |
| i | 202424 | 5.4% |
| u | 202424 | 5.4% |
| d | 202424 | 5.4% |
| s | 183750 | 4.9% |
| N | 116985 | 3.1% |
| Other values (7) | 689810 |
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| ? | |
|---|---|
| Not in universe | |
| No | |
| Yes | 5786 |
Length
| Max length | 16 |
|---|---|
| Median length | 3 |
| Mean length | 8.005899069 |
| Min length | 2 |
Characters and Unicode
| Total characters | 1597361 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ? |
|---|---|
| 2nd row | Yes |
| 3rd row | ? |
| 4th row | Not in universe |
| 5th row | Not in universe |
Common Values
| Value | Count | Frequency (%) |
| ? | 99696 | |
| Not in universe | 84054 | |
| No | 9987 | 5.0% |
| Yes | 5786 | 2.9% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 99696 | ||
| not | 84054 | |
| in | 84054 | |
| universe | 84054 | |
| no | 9987 | 2.7% |
| yes | 5786 | 1.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 367631 | ||
| e | 173894 | |
| i | 168108 | |
| n | 168108 | |
| ? | 99696 | 6.2% |
| N | 94041 | 5.9% |
| o | 94041 | 5.9% |
| s | 89840 | 5.6% |
| t | 84054 | 5.3% |
| u | 84054 | 5.3% |
| Other values (3) | 173894 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1030207 | |
| Space Separator | 367631 | 23.0% |
| Uppercase Letter | 99827 | 6.2% |
| Other Punctuation | 99696 | 6.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 173894 | |
| i | 168108 | |
| n | 168108 | |
| o | 94041 | |
| s | 89840 | |
| t | 84054 | |
| u | 84054 | |
| v | 84054 | |
| r | 84054 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 94041 | |
| Y | 5786 | 5.8% |
Space Separator
| Value | Count | Frequency (%) |
| 367631 |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 99696 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1130034 | |
| Common | 467327 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 173894 | |
| i | 168108 | |
| n | 168108 | |
| N | 94041 | |
| o | 94041 | |
| s | 89840 | |
| t | 84054 | |
| u | 84054 | |
| v | 84054 | |
| r | 84054 |
Common
| Value | Count | Frequency (%) |
| 367631 | ||
| ? | 99696 | 21.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1597361 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 367631 | ||
| e | 173894 | |
| i | 168108 | |
| n | 168108 | |
| ? | 99696 | 6.2% |
| N | 94041 | 5.9% |
| o | 94041 | 5.9% |
| s | 89840 | 5.6% |
| t | 84054 | 5.3% |
| u | 84054 | 5.3% |
| Other values (3) | 173894 |
num_persons_worked_for_employer
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.95618049 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 95983 |
| Zeros (%) | 48.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 4 |
| 95-th percentile | 6 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.365125505 |
|---|---|
| Coefficient of variation (CV) | 1.209052803 |
| Kurtosis | -1.082246833 |
| Mean | 1.95618049 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.7515606804 |
| Sum | 390303 |
| Variance | 5.593818657 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 95983 | |
| 6 | 36511 | 18.3% |
| 1 | 23109 | 11.6% |
| 4 | 14379 | 7.2% |
| 3 | 13425 | 6.7% |
| 2 | 10081 | 5.1% |
| 5 | 6035 | 3.0% |
| Value | Count | Frequency (%) |
| 0 | 95983 | |
| 1 | 23109 | 11.6% |
| 2 | 10081 | 5.1% |
| 3 | 13425 | 6.7% |
| 4 | 14379 | 7.2% |
| 5 | 6035 | 3.0% |
| 6 | 36511 | 18.3% |
| Value | Count | Frequency (%) |
| 6 | 36511 | 18.3% |
| 5 | 6035 | 3.0% |
| 4 | 14379 | 7.2% |
| 3 | 13425 | 6.7% |
| 2 | 10081 | 5.1% |
| 1 | 23109 | 11.6% |
| 0 | 95983 |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Not in universe | |
|---|---|
| Both parents present | |
| Mother only present | 12772 |
| Father only present | 1883 |
| Neither parent present | 1653 |
Length
| Max length | 23 |
|---|---|
| Median length | 16 |
| Mean length | 17.32869895 |
| Min length | 16 |
Characters and Unicode
| Total characters | 3457474 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not in universe |
|---|---|
| 2nd row | Not in universe |
| 3rd row | Not in universe |
| 4th row | Both parents present |
| 5th row | Both parents present |
Common Values
| Value | Count | Frequency (%) |
| Not in universe | 144232 | |
| Both parents present | 38983 | 19.5% |
| Mother only present | 12772 | 6.4% |
| Father only present | 1883 | 0.9% |
| Neither parent present | 1653 | 0.8% |
Length
Pie chart
| Value | Count | Frequency (%) |
| not | 144232 | |
| in | 144232 | |
| universe | 144232 | |
| present | 55291 | 9.2% |
| both | 38983 | 6.5% |
| parents | 38983 | 6.5% |
| only | 14655 | 2.4% |
| mother | 12772 | 2.1% |
| father | 1883 | 0.3% |
| neither | 1653 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 598569 | ||
| e | 457643 | |
| n | 399046 | |
| t | 295450 | |
| i | 290117 | |
| r | 256467 | |
| s | 238506 | 6.9% |
| o | 210642 | 6.1% |
| N | 145885 | 4.2% |
| u | 144232 | 4.2% |
| Other values (9) | 420917 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2659382 | |
| Space Separator | 598569 | 17.3% |
| Uppercase Letter | 199523 | 5.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 457643 | |
| n | 399046 | |
| t | 295450 | |
| i | 290117 | |
| r | 256467 | |
| s | 238506 | |
| o | 210642 | |
| u | 144232 | 5.4% |
| v | 144232 | 5.4% |
| p | 95927 | 3.6% |
| Other values (4) | 127120 | 4.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 145885 | |
| B | 38983 | 19.5% |
| M | 12772 | 6.4% |
| F | 1883 | 0.9% |
Space Separator
| Value | Count | Frequency (%) |
| 598569 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2858905 | |
| Common | 598569 | 17.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 457643 | |
| n | 399046 | |
| t | 295450 | |
| i | 290117 | |
| r | 256467 | |
| s | 238506 | |
| o | 210642 | |
| N | 145885 | 5.1% |
| u | 144232 | 5.0% |
| v | 144232 | 5.0% |
| Other values (8) | 276685 |
Common
| Value | Count | Frequency (%) |
| 598569 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3457474 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 598569 | ||
| e | 457643 | |
| n | 399046 | |
| t | 295450 | |
| i | 290117 | |
| r | 256467 | |
| s | 238506 | 6.9% |
| o | 210642 | 6.1% |
| N | 145885 | 4.2% |
| u | 144232 | 4.2% |
| Other values (9) | 420917 |
| Distinct | 43 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| United-States | |
|---|---|
| Mexico | 10008 |
| ? | 6713 |
| Puerto-Rico | 2680 |
| Italy | 2212 |
| Other values (38) |
Length
| Max length | 29 |
|---|---|
| Median length | 14 |
| Mean length | 12.66875999 |
| Min length | 2 |
Characters and Unicode
| Total characters | 2527709 |
|---|---|
| Distinct characters | 47 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | United-States |
|---|---|
| 2nd row | United-States |
| 3rd row | Vietnam |
| 4th row | United-States |
| 5th row | United-States |
Common Values
| Value | Count | Frequency (%) |
| United-States | 159163 | |
| Mexico | 10008 | 5.0% |
| ? | 6713 | 3.4% |
| Puerto-Rico | 2680 | 1.3% |
| Italy | 2212 | 1.1% |
| Canada | 1380 | 0.7% |
| Germany | 1356 | 0.7% |
| Dominican-Republic | 1290 | 0.6% |
| Poland | 1212 | 0.6% |
| Philippines | 1154 | 0.6% |
| Other values (33) | 12355 | 6.2% |
Length
| Value | Count | Frequency (%) |
| united-states | 159163 | |
| mexico | 10008 | 5.0% |
| 6713 | 3.3% | |
| puerto-rico | 2680 | 1.3% |
| italy | 2212 | 1.1% |
| canada | 1380 | 0.7% |
| germany | 1356 | 0.7% |
| dominican-republic | 1290 | 0.6% |
| poland | 1212 | 0.6% |
| philippines | 1154 | 0.6% |
| Other values (39) | 13627 | 6.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 485168 | |
| e | 338573 | |
| 200795 | ||
| a | 185809 | 7.4% |
| i | 184161 | 7.3% |
| n | 173312 | 6.9% |
| d | 166069 | 6.6% |
| - | 164325 | 6.5% |
| S | 161240 | 6.4% |
| s | 160933 | 6.4% |
| Other values (37) | 307324 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1796607 | |
| Uppercase Letter | 358838 | 14.2% |
| Space Separator | 200795 | 7.9% |
| Dash Punctuation | 164325 | 6.5% |
| Other Punctuation | 6826 | 0.3% |
| Open Punctuation | 159 | < 0.1% |
| Close Punctuation | 159 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 485168 | |
| e | 338573 | |
| a | 185809 | 10.3% |
| i | 184161 | 10.3% |
| n | 173312 | 9.6% |
| d | 166069 | 9.2% |
| s | 160933 | 9.0% |
| o | 22790 | 1.3% |
| c | 17366 | 1.0% |
| l | 11412 | 0.6% |
| Other values (11) | 51014 | 2.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 161240 | |
| U | 159481 | |
| M | 10008 | 2.8% |
| P | 5794 | 1.6% |
| C | 4171 | 1.2% |
| R | 3970 | 1.1% |
| I | 3692 | 1.0% |
| G | 2304 | 0.6% |
| E | 2154 | 0.6% |
| D | 1290 | 0.4% |
| Other values (10) | 4734 | 1.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 6713 | |
| & | 113 | 1.7% |
Space Separator
| Value | Count | Frequency (%) |
| 200795 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 164325 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 159 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 159 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2155445 | |
| Common | 372264 | 14.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 485168 | |
| e | 338573 | |
| a | 185809 | 8.6% |
| i | 184161 | 8.5% |
| n | 173312 | 8.0% |
| d | 166069 | 7.7% |
| S | 161240 | 7.5% |
| s | 160933 | 7.5% |
| U | 159481 | 7.4% |
| o | 22790 | 1.1% |
| Other values (31) | 117909 | 5.5% |
Common
| Value | Count | Frequency (%) |
| 200795 | ||
| - | 164325 | |
| ? | 6713 | 1.8% |
| ( | 159 | < 0.1% |
| ) | 159 | < 0.1% |
| & | 113 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2527709 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 485168 | |
| e | 338573 | |
| 200795 | ||
| a | 185809 | 7.4% |
| i | 184161 | 7.3% |
| n | 173312 | 6.9% |
| d | 166069 | 6.6% |
| - | 164325 | 6.5% |
| S | 161240 | 6.4% |
| s | 160933 | 6.4% |
| Other values (37) | 307324 |
| Distinct | 43 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| United-States | |
|---|---|
| Mexico | 9781 |
| ? | 6119 |
| Puerto-Rico | 2473 |
| Italy | 1844 |
| Other values (38) |
Length
| Max length | 29 |
|---|---|
| Median length | 14 |
| Mean length | 12.72127023 |
| Min length | 2 |
Characters and Unicode
| Total characters | 2538186 |
|---|---|
| Distinct characters | 47 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | United-States |
|---|---|
| 2nd row | United-States |
| 3rd row | Vietnam |
| 4th row | United-States |
| 5th row | United-States |
Common Values
| Value | Count | Frequency (%) |
| United-States | 160479 | |
| Mexico | 9781 | 4.9% |
| ? | 6119 | 3.1% |
| Puerto-Rico | 2473 | 1.2% |
| Italy | 1844 | 0.9% |
| Canada | 1451 | 0.7% |
| Germany | 1382 | 0.7% |
| Philippines | 1231 | 0.6% |
| Poland | 1110 | 0.6% |
| El-Salvador | 1108 | 0.6% |
| Other values (33) | 12545 | 6.3% |
Length
| Value | Count | Frequency (%) |
| united-states | 160479 | |
| mexico | 9781 | 4.9% |
| 6119 | 3.0% | |
| puerto-rico | 2473 | 1.2% |
| italy | 1844 | 0.9% |
| canada | 1451 | 0.7% |
| germany | 1382 | 0.7% |
| philippines | 1231 | 0.6% |
| poland | 1110 | 0.6% |
| el-salvador | 1108 | 0.6% |
| Other values (39) | 13889 | 6.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 488579 | |
| e | 340658 | |
| 200867 | ||
| a | 187061 | 7.4% |
| i | 184556 | 7.3% |
| n | 174658 | 6.9% |
| d | 167641 | 6.6% |
| - | 165369 | 6.5% |
| S | 162751 | 6.4% |
| s | 162309 | 6.4% |
| Other values (37) | 303737 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1804888 | |
| Uppercase Letter | 360530 | 14.2% |
| Space Separator | 200867 | 7.9% |
| Dash Punctuation | 165369 | 6.5% |
| Other Punctuation | 6218 | 0.2% |
| Open Punctuation | 157 | < 0.1% |
| Close Punctuation | 157 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 488579 | |
| e | 340658 | |
| a | 187061 | 10.4% |
| i | 184556 | 10.2% |
| n | 174658 | 9.7% |
| d | 167641 | 9.3% |
| s | 162309 | 9.0% |
| o | 22004 | 1.2% |
| c | 16460 | 0.9% |
| l | 11200 | 0.6% |
| Other values (11) | 49762 | 2.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 162751 | |
| U | 160793 | |
| M | 9781 | 2.7% |
| P | 5543 | 1.5% |
| C | 4088 | 1.1% |
| R | 3576 | 1.0% |
| I | 3379 | 0.9% |
| E | 2386 | 0.7% |
| G | 2244 | 0.6% |
| D | 1103 | 0.3% |
| Other values (10) | 4886 | 1.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 6119 | |
| & | 99 | 1.6% |
Space Separator
| Value | Count | Frequency (%) |
| 200867 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 165369 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 157 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 157 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2165418 | |
| Common | 372768 | 14.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 488579 | |
| e | 340658 | |
| a | 187061 | 8.6% |
| i | 184556 | 8.5% |
| n | 174658 | 8.1% |
| d | 167641 | 7.7% |
| S | 162751 | 7.5% |
| s | 162309 | 7.5% |
| U | 160793 | 7.4% |
| o | 22004 | 1.0% |
| Other values (31) | 114408 | 5.3% |
Common
| Value | Count | Frequency (%) |
| 200867 | ||
| - | 165369 | |
| ? | 6119 | 1.6% |
| ( | 157 | < 0.1% |
| ) | 157 | < 0.1% |
| & | 99 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2538186 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 488579 | |
| e | 340658 | |
| 200867 | ||
| a | 187061 | 7.4% |
| i | 184556 | 7.3% |
| n | 174658 | 6.9% |
| d | 167641 | 6.6% |
| - | 165369 | 6.5% |
| S | 162751 | 6.4% |
| s | 162309 | 6.4% |
| Other values (37) | 303737 |
| Distinct | 43 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| United-States | |
|---|---|
| Mexico | 5767 |
| ? | 3393 |
| Puerto-Rico | 1400 |
| Germany | 851 |
| Other values (38) | 11123 |
Length
| Max length | 29 |
|---|---|
| Median length | 14 |
| Mean length | 13.27975722 |
| Min length | 2 |
Characters and Unicode
| Total characters | 2649617 |
|---|---|
| Distinct characters | 47 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | United-States |
|---|---|
| 2nd row | United-States |
| 3rd row | Vietnam |
| 4th row | United-States |
| 5th row | United-States |
Common Values
| Value | Count | Frequency (%) |
| United-States | 176989 | |
| Mexico | 5767 | 2.9% |
| ? | 3393 | 1.7% |
| Puerto-Rico | 1400 | 0.7% |
| Germany | 851 | 0.4% |
| Philippines | 845 | 0.4% |
| Cuba | 837 | 0.4% |
| Canada | 700 | 0.4% |
| Dominican-Republic | 690 | 0.3% |
| El-Salvador | 689 | 0.3% |
| Other values (33) | 7362 | 3.7% |
Length
| Value | Count | Frequency (%) |
| united-states | 176989 | |
| mexico | 5767 | 2.9% |
| 3393 | 1.7% | |
| puerto-rico | 1400 | 0.7% |
| germany | 851 | 0.4% |
| philippines | 845 | 0.4% |
| cuba | 837 | 0.4% |
| canada | 700 | 0.3% |
| dominican-republic | 690 | 0.3% |
| el-salvador | 689 | 0.3% |
| Other values (39) | 8409 | 4.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 534730 | |
| e | 365867 | |
| 200570 | 7.6% | |
| a | 192481 | 7.3% |
| i | 192126 | 7.3% |
| n | 185160 | 7.0% |
| d | 180622 | 6.8% |
| - | 179910 | 6.8% |
| S | 178462 | 6.7% |
| s | 178172 | 6.7% |
| Other values (37) | 261517 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1888049 | |
| Uppercase Letter | 377391 | 14.2% |
| Space Separator | 200570 | 7.6% |
| Dash Punctuation | 179910 | 6.8% |
| Other Punctuation | 3459 | 0.1% |
| Open Punctuation | 119 | < 0.1% |
| Close Punctuation | 119 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 534730 | |
| e | 365867 | |
| a | 192481 | 10.2% |
| i | 192126 | 10.2% |
| n | 185160 | 9.8% |
| d | 180622 | 9.6% |
| s | 178172 | 9.4% |
| o | 12975 | 0.7% |
| c | 9805 | 0.5% |
| x | 5767 | 0.3% |
| Other values (11) | 30344 | 1.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 178462 | |
| U | 177227 | |
| M | 5767 | 1.5% |
| P | 3096 | 0.8% |
| C | 2544 | 0.7% |
| R | 2090 | 0.6% |
| G | 1461 | 0.4% |
| E | 1404 | 0.4% |
| I | 1238 | 0.3% |
| D | 690 | 0.2% |
| Other values (10) | 3412 | 0.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 3393 | |
| & | 66 | 1.9% |
Space Separator
| Value | Count | Frequency (%) |
| 200570 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 179910 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 119 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 119 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2265440 | |
| Common | 384177 | 14.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 534730 | |
| e | 365867 | |
| a | 192481 | 8.5% |
| i | 192126 | 8.5% |
| n | 185160 | 8.2% |
| d | 180622 | 8.0% |
| S | 178462 | 7.9% |
| s | 178172 | 7.9% |
| U | 177227 | 7.8% |
| o | 12975 | 0.6% |
| Other values (31) | 67618 | 3.0% |
Common
| Value | Count | Frequency (%) |
| 200570 | ||
| - | 179910 | |
| ? | 3393 | 0.9% |
| ( | 119 | < 0.1% |
| ) | 119 | < 0.1% |
| & | 66 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2649617 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 534730 | |
| e | 365867 | |
| 200570 | 7.6% | |
| a | 192481 | 7.3% |
| i | 192126 | 7.3% |
| n | 185160 | 7.0% |
| d | 180622 | 6.8% |
| - | 179910 | 6.8% |
| S | 178462 | 6.7% |
| s | 178172 | 6.7% |
| Other values (37) | 261517 |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Native- Born in the United States | |
|---|---|
| Foreign born- Not a citizen of U S | 13401 |
| Foreign born- U S citizen by naturalization | 5855 |
| Native- Born abroad of American Parent(s) | 1756 |
| Native- Born in Puerto Rico or U S Outlying | 1519 |
Length
| Max length | 44 |
|---|---|
| Median length | 34 |
| Mean length | 34.57431975 |
| Min length | 34 |
Characters and Unicode
| Total characters | 6898372 |
|---|---|
| Distinct characters | 33 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Native- Born in the United States |
|---|---|
| 2nd row | Native- Born in the United States |
| 3rd row | Foreign born- Not a citizen of U S |
| 4th row | Native- Born in the United States |
| 5th row | Native- Born in the United States |
Common Values
| Value | Count | Frequency (%) |
| Native- Born in the United States | 176992 | |
| Foreign born- Not a citizen of U S | 13401 | 6.7% |
| Foreign born- U S citizen by naturalization | 5855 | 2.9% |
| Native- Born abroad of American Parent(s) | 1756 | 0.9% |
| Native- Born in Puerto Rico or U S Outlying | 1519 | 0.8% |
Length
Pie chart
| Value | Count | Frequency (%) |
| born | 199523 | |
| native | 180267 | |
| in | 178511 | |
| the | 176992 | |
| united | 176992 | |
| states | 176992 | |
| s | 20775 | 1.7% |
| u | 20775 | 1.7% |
| citizen | 19256 | 1.6% |
| foreign | 19256 | 1.6% |
| Other values (12) | 65013 | 5.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1247753 | ||
| t | 937396 | |
| e | 754786 | |
| n | 610279 | |
| i | 610042 | |
| a | 395249 | 5.7% |
| o | 259505 | 3.8% |
| r | 232940 | 3.4% |
| - | 199523 | 2.9% |
| U | 197767 | 2.9% |
| Other values (23) | 1453132 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4650790 | |
| Space Separator | 1247753 | 18.1% |
| Uppercase Letter | 796794 | 11.6% |
| Dash Punctuation | 199523 | 2.9% |
| Open Punctuation | 1756 | < 0.1% |
| Close Punctuation | 1756 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 937396 | |
| e | 754786 | |
| n | 610279 | |
| i | 610042 | |
| a | 395249 | |
| o | 259505 | 5.6% |
| r | 232940 | 5.0% |
| v | 180267 | 3.9% |
| d | 178748 | 3.8% |
| s | 178748 | 3.8% |
| Other values (10) | 312830 | 6.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 197767 | |
| S | 197767 | |
| N | 193668 | |
| B | 180267 | |
| F | 19256 | 2.4% |
| P | 3275 | 0.4% |
| A | 1756 | 0.2% |
| R | 1519 | 0.2% |
| O | 1519 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 1247753 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 199523 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1756 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1756 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5447584 | |
| Common | 1450788 | 21.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 937396 | |
| e | 754786 | |
| n | 610279 | |
| i | 610042 | |
| a | 395249 | 7.3% |
| o | 259505 | 4.8% |
| r | 232940 | 4.3% |
| U | 197767 | 3.6% |
| S | 197767 | 3.6% |
| N | 193668 | 3.6% |
| Other values (19) | 1058185 |
Common
| Value | Count | Frequency (%) |
| 1247753 | ||
| - | 199523 | 13.8% |
| ( | 1756 | 0.1% |
| ) | 1756 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6898372 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1247753 | ||
| t | 937396 | |
| e | 754786 | |
| n | 610279 | |
| i | 610042 | |
| a | 395249 | 5.7% |
| o | 259505 | 3.8% |
| r | 232940 | 3.4% |
| - | 199523 | 2.9% |
| U | 197767 | 2.9% |
| Other values (23) | 1453132 |
own_business_or_self_employed
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| 0 | |
|---|---|
| 2 | 16153 |
| 1 | 2698 |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 399046 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 180672 | |
| 2 | 16153 | 8.1% |
| 1 | 2698 | 1.4% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 0 | 180672 | |
| 2 | 16153 | 8.1% |
| 1 | 2698 | 1.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 199523 | ||
| 0 | 180672 | |
| 2 | 16153 | 4.0% |
| 1 | 2698 | 0.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Space Separator | 199523 | |
| Decimal Number | 199523 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 180672 | |
| 2 | 16153 | 8.1% |
| 1 | 2698 | 1.4% |
Space Separator
| Value | Count | Frequency (%) |
| 199523 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 399046 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 199523 | ||
| 0 | 180672 | |
| 2 | 16153 | 4.0% |
| 1 | 2698 | 0.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 399046 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 199523 | ||
| 0 | 180672 | |
| 2 | 16153 | 4.0% |
| 1 | 2698 | 0.7% |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Not in universe | |
|---|---|
| No | 1593 |
| Yes | 391 |
Length
| Max length | 16 |
|---|---|
| Median length | 16 |
| Mean length | 15.87269137 |
| Min length | 3 |
Characters and Unicode
| Total characters | 3166967 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not in universe |
|---|---|
| 2nd row | Not in universe |
| 3rd row | Not in universe |
| 4th row | Not in universe |
| 5th row | Not in universe |
Common Values
| Value | Count | Frequency (%) |
| Not in universe | 197539 | |
| No | 1593 | 0.8% |
| Yes | 391 | 0.2% |
Length
Pie chart
| Value | Count | Frequency (%) |
| not | 197539 | |
| in | 197539 | |
| universe | 197539 | |
| no | 1593 | 0.3% |
| yes | 391 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 594601 | ||
| e | 395469 | |
| i | 395078 | |
| n | 395078 | |
| N | 199132 | 6.3% |
| o | 199132 | 6.3% |
| s | 197930 | 6.2% |
| t | 197539 | 6.2% |
| u | 197539 | 6.2% |
| v | 197539 | 6.2% |
| Other values (2) | 197930 | 6.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2372843 | |
| Space Separator | 594601 | 18.8% |
| Uppercase Letter | 199523 | 6.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 395469 | |
| i | 395078 | |
| n | 395078 | |
| o | 199132 | |
| s | 197930 | |
| t | 197539 | |
| u | 197539 | |
| v | 197539 | |
| r | 197539 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 199132 | |
| Y | 391 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 594601 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2572366 | |
| Common | 594601 | 18.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 395469 | |
| i | 395078 | |
| n | 395078 | |
| N | 199132 | |
| o | 199132 | |
| s | 197930 | |
| t | 197539 | |
| u | 197539 | |
| v | 197539 | |
| r | 197539 |
Common
| Value | Count | Frequency (%) |
| 594601 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3166967 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 594601 | ||
| e | 395469 | |
| i | 395078 | |
| n | 395078 | |
| N | 199132 | 6.3% |
| o | 199132 | 6.3% |
| s | 197930 | 6.2% |
| t | 197539 | 6.2% |
| u | 197539 | 6.2% |
| v | 197539 | 6.2% |
| Other values (2) | 197930 | 6.2% |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| 2 | |
|---|---|
| 0 | |
| 1 | 1984 |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 399046 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 2 |
| 3rd row | 2 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 150130 | |
| 0 | 47409 | 23.8% |
| 1 | 1984 | 1.0% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 2 | 150130 | |
| 0 | 47409 | 23.8% |
| 1 | 1984 | 1.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 199523 | ||
| 2 | 150130 | |
| 0 | 47409 | 11.9% |
| 1 | 1984 | 0.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Space Separator | 199523 | |
| Decimal Number | 199523 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 150130 | |
| 0 | 47409 | 23.8% |
| 1 | 1984 | 1.0% |
Space Separator
| Value | Count | Frequency (%) |
| 199523 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 399046 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 199523 | ||
| 2 | 150130 | |
| 0 | 47409 | 11.9% |
| 1 | 1984 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 399046 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 199523 | ||
| 2 | 150130 | |
| 0 | 47409 | 11.9% |
| 1 | 1984 | 0.5% |
weeks_worked_in_year
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 53 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 23.17489713 |
| Minimum | 0 |
|---|---|
| Maximum | 52 |
| Zeros | 95983 |
| Zeros (%) | 48.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 8 |
| Q3 | 52 |
| 95-th percentile | 52 |
| Maximum | 52 |
| Range | 52 |
| Interquartile range (IQR) | 52 |
Descriptive statistics
| Standard deviation | 24.41148817 |
|---|---|
| Coefficient of variation (CV) | 1.053359073 |
| Kurtosis | -1.863805826 |
| Mean | 23.17489713 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 0.2101693419 |
| Sum | 4623925 |
| Variance | 595.9207546 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 95983 | |
| 52 | 70314 | |
| 40 | 2790 | 1.4% |
| 50 | 2304 | 1.2% |
| 26 | 2268 | 1.1% |
| 48 | 1806 | 0.9% |
| 12 | 1780 | 0.9% |
| 30 | 1378 | 0.7% |
| 20 | 1330 | 0.7% |
| 8 | 1126 | 0.6% |
| Other values (43) | 18444 | 9.2% |
| Value | Count | Frequency (%) |
| 0 | 95983 | |
| 1 | 464 | 0.2% |
| 2 | 458 | 0.2% |
| 3 | 417 | 0.2% |
| 4 | 757 | 0.4% |
| 5 | 309 | 0.2% |
| 6 | 646 | 0.3% |
| 7 | 152 | 0.1% |
| 8 | 1126 | 0.6% |
| 9 | 239 | 0.1% |
| Value | Count | Frequency (%) |
| 52 | 70314 | |
| 51 | 819 | 0.4% |
| 50 | 2304 | 1.2% |
| 49 | 509 | 0.3% |
| 48 | 1806 | 0.9% |
| 47 | 278 | 0.1% |
| 46 | 708 | 0.4% |
| 45 | 669 | 0.3% |
| 44 | 845 | 0.4% |
| 43 | 374 | 0.2% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| 94 | |
|---|---|
| 95 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 598569 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 95 |
|---|---|
| 2nd row | 94 |
| 3rd row | 95 |
| 4th row | 94 |
| 5th row | 94 |
Common Values
| Value | Count | Frequency (%) |
| 94 | 99827 | |
| 95 | 99696 |
Length
Pie chart
| Value | Count | Frequency (%) |
| 94 | 99827 | |
| 95 | 99696 |
Most occurring characters
| Value | Count | Frequency (%) |
| 199523 | ||
| 9 | 199523 | |
| 4 | 99827 | |
| 5 | 99696 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 399046 | |
| Space Separator | 199523 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 199523 | |
| 4 | 99827 | |
| 5 | 99696 |
Space Separator
| Value | Count | Frequency (%) |
| 199523 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 598569 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 199523 | ||
| 9 | 199523 | |
| 4 | 99827 | |
| 5 | 99696 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 598569 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 199523 | ||
| 9 | 199523 | |
| 4 | 99827 | |
| 5 | 99696 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| 0 | |
|---|---|
| 1 | 12382 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 199523 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 187141 | |
| 1 | 12382 | 6.2% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 0 | 187141 | |
| 1 | 12382 | 6.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 187141 | |
| 1 | 12382 | 6.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 199523 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 187141 | |
| 1 | 12382 | 6.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 199523 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 187141 | |
| 1 | 12382 | 6.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 199523 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 187141 | |
| 1 | 12382 | 6.2% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| age | class_of_worker | industry_code | occupation_code | education | wage_per_hour | enrolled_in_edu_inst_last_wk | marital_status | major_industry_code | major_occupation_code | race | hispanic_Origin | sex | member_of_a_labor_union | reason_for_unemployment | full_or_part_time_employment_stat | capital_gains | capital_losses | divdends_from_stocks | tax_filer_status | region_of_previous_residence | state_of_previous_residence | detailed_household_and_family_stat | detailed_household_summary_in_household | instance_weight | migration_code_change_in_msa | migration_code_change_in_reg | migration_code_move_within_reg | live_in_this_house_1_year_ago | migration_prev_res_in_sunbelt | num_persons_worked_for_employer | family_members_under_18 | country_of_birth_father | country_of_birth_mother | country_of_birth_self | citizenship | own_business_or_self_employed | fill_inc_questionnaire_for_veterans_admin | veterans_benefits | weeks_worked_in_year | year | label | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 73 | Not in universe | 0 | 0 | High school graduate | 0 | Not in universe | Widowed | Not in universe or children | Not in universe | White | All other | Female | Not in universe | Not in universe | Not in labor force | 0 | 0 | 0 | Nonfiler | Not in universe | Not in universe | Other Rel 18+ ever marr not in subfamily | Other relative of householder | 1700.09 | ? | ? | ? | Not in universe under 1 year old | ? | 0 | Not in universe | United-States | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 2 | 0 | 95 | 0 |
| 1 | 58 | Self-employed-not incorporated | 4 | 34 | Some college but no degree | 0 | Not in universe | Divorced | Construction | Precision production craft & repair | White | All other | Male | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Head of household | South | Arkansas | Householder | Householder | 1053.55 | MSA to MSA | Same county | Same county | No | Yes | 1 | Not in universe | United-States | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 2 | 52 | 94 | 0 |
| 2 | 18 | Not in universe | 0 | 0 | 10th grade | 0 | High school | Never married | Not in universe or children | Not in universe | Asian or Pacific Islander | All other | Female | Not in universe | Not in universe | Not in labor force | 0 | 0 | 0 | Nonfiler | Not in universe | Not in universe | Child 18+ never marr Not in a subfamily | Child 18 or older | 991.95 | ? | ? | ? | Not in universe under 1 year old | ? | 0 | Not in universe | Vietnam | Vietnam | Vietnam | Foreign born- Not a citizen of U S | 0 | Not in universe | 2 | 0 | 95 | 0 |
| 3 | 9 | Not in universe | 0 | 0 | Children | 0 | Not in universe | Never married | Not in universe or children | Not in universe | White | All other | Female | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Nonfiler | Not in universe | Not in universe | Child <18 never marr not in subfamily | Child under 18 never married | 1758.14 | Nonmover | Nonmover | Nonmover | Yes | Not in universe | 0 | Both parents present | United-States | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 0 | 0 | 94 | 0 |
| 4 | 10 | Not in universe | 0 | 0 | Children | 0 | Not in universe | Never married | Not in universe or children | Not in universe | White | All other | Female | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Nonfiler | Not in universe | Not in universe | Child <18 never marr not in subfamily | Child under 18 never married | 1069.16 | Nonmover | Nonmover | Nonmover | Yes | Not in universe | 0 | Both parents present | United-States | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 0 | 0 | 94 | 0 |
| 5 | 48 | Private | 40 | 10 | Some college but no degree | 1200 | Not in universe | Married-civilian spouse present | Entertainment | Professional specialty | Amer Indian Aleut or Eskimo | All other | Female | No | Not in universe | Full-time schedules | 0 | 0 | 0 | Joint both under 65 | Not in universe | Not in universe | Spouse of householder | Spouse of householder | 162.61 | ? | ? | ? | Not in universe under 1 year old | ? | 1 | Not in universe | Philippines | United-States | United-States | Native- Born in the United States | 2 | Not in universe | 2 | 52 | 95 | 0 |
| 6 | 42 | Private | 34 | 3 | Bachelors degree(BA AB BS) | 0 | Not in universe | Married-civilian spouse present | Finance insurance and real estate | Executive admin and managerial | White | All other | Male | Not in universe | Not in universe | Children or Armed Forces | 5178 | 0 | 0 | Joint both under 65 | Not in universe | Not in universe | Householder | Householder | 1535.86 | Nonmover | Nonmover | Nonmover | Yes | Not in universe | 6 | Not in universe | United-States | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 2 | 52 | 94 | 0 |
| 7 | 28 | Private | 4 | 40 | High school graduate | 0 | Not in universe | Never married | Construction | Handlers equip cleaners etc | White | All other | Female | Not in universe | Job loser - on layoff | Unemployed full-time | 0 | 0 | 0 | Single | Not in universe | Not in universe | Secondary individual | Nonrelative of householder | 898.83 | ? | ? | ? | Not in universe under 1 year old | ? | 4 | Not in universe | United-States | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 2 | 30 | 95 | 0 |
| 8 | 47 | Local government | 43 | 26 | Some college but no degree | 876 | Not in universe | Married-civilian spouse present | Education | Adm support including clerical | White | All other | Female | No | Not in universe | Full-time schedules | 0 | 0 | 0 | Joint both under 65 | Not in universe | Not in universe | Spouse of householder | Spouse of householder | 1661.53 | ? | ? | ? | Not in universe under 1 year old | ? | 5 | Not in universe | United-States | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 2 | 52 | 95 | 0 |
| 9 | 34 | Private | 4 | 37 | Some college but no degree | 0 | Not in universe | Married-civilian spouse present | Construction | Machine operators assmblrs & inspctrs | White | All other | Male | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Joint both under 65 | Not in universe | Not in universe | Householder | Householder | 1146.79 | Nonmover | Nonmover | Nonmover | Yes | Not in universe | 6 | Not in universe | United-States | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 2 | 52 | 94 | 0 |
Last rows
| age | class_of_worker | industry_code | occupation_code | education | wage_per_hour | enrolled_in_edu_inst_last_wk | marital_status | major_industry_code | major_occupation_code | race | hispanic_Origin | sex | member_of_a_labor_union | reason_for_unemployment | full_or_part_time_employment_stat | capital_gains | capital_losses | divdends_from_stocks | tax_filer_status | region_of_previous_residence | state_of_previous_residence | detailed_household_and_family_stat | detailed_household_summary_in_household | instance_weight | migration_code_change_in_msa | migration_code_change_in_reg | migration_code_move_within_reg | live_in_this_house_1_year_ago | migration_prev_res_in_sunbelt | num_persons_worked_for_employer | family_members_under_18 | country_of_birth_father | country_of_birth_mother | country_of_birth_self | citizenship | own_business_or_self_employed | fill_inc_questionnaire_for_veterans_admin | veterans_benefits | weeks_worked_in_year | year | label | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 199513 | 57 | Private | 9 | 37 | 9th grade | 0 | Not in universe | Divorced | Manufacturing-durable goods | Machine operators assmblrs & inspctrs | White | Central or South American | Female | Not in universe | Not in universe | Full-time schedules | 0 | 0 | 0 | Single | Not in universe | Not in universe | Householder | Householder | 743.66 | ? | ? | ? | Not in universe under 1 year old | ? | 4 | Not in universe | Dominican-Republic | Dominican-Republic | Dominican-Republic | Foreign born- Not a citizen of U S | 0 | Not in universe | 2 | 52 | 95 | 0 |
| 199514 | 51 | Private | 33 | 19 | 10th grade | 0 | Not in universe | Widowed | Retail trade | Sales | White | All other | Female | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Single | South | North Dakota | Householder | Householder | 1302.34 | NonMSA to nonMSA | Same county | Same county | No | Yes | 6 | Not in universe | United-States | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 2 | 52 | 94 | 0 |
| 199515 | 87 | Not in universe | 0 | 0 | High school graduate | 0 | Not in universe | Widowed | Not in universe or children | Not in universe | White | All other | Female | Not in universe | Not in universe | Not in labor force | 0 | 0 | 0 | Single | Not in universe | Not in universe | Nonfamily householder | Householder | 3255.80 | ? | ? | ? | Not in universe under 1 year old | ? | 0 | Not in universe | ? | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 2 | 0 | 95 | 0 |
| 199516 | 3 | Not in universe | 0 | 0 | Children | 0 | Not in universe | Never married | Not in universe or children | Not in universe | Black | All other | Male | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Nonfiler | South | Utah | Child under 18 of RP of unrel subfamily | Nonrelative of householder | 2733.75 | MSA to MSA | Same county | Same county | No | Yes | 0 | Mother only present | United-States | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 0 | 0 | 94 | 0 |
| 199517 | 39 | Private | 43 | 26 | Bachelors degree(BA AB BS) | 0 | Not in universe | Never married | Education | Adm support including clerical | Other | Mexican-American | Male | No | Not in universe | Full-time schedules | 6849 | 0 | 0 | Single | Not in universe | Not in universe | Nonfamily householder | Householder | 908.14 | ? | ? | ? | Not in universe under 1 year old | ? | 6 | Not in universe | Mexico | Mexico | Mexico | Foreign born- Not a citizen of U S | 2 | Not in universe | 2 | 52 | 95 | 0 |
| 199518 | 87 | Not in universe | 0 | 0 | 7th and 8th grade | 0 | Not in universe | Married-civilian spouse present | Not in universe or children | Not in universe | White | All other | Male | Not in universe | Not in universe | Not in labor force | 0 | 0 | 0 | Joint both 65+ | Not in universe | Not in universe | Householder | Householder | 955.27 | ? | ? | ? | Not in universe under 1 year old | ? | 0 | Not in universe | Canada | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 2 | 0 | 95 | 0 |
| 199519 | 65 | Self-employed-incorporated | 37 | 2 | 11th grade | 0 | Not in universe | Married-civilian spouse present | Business and repair services | Executive admin and managerial | White | All other | Male | Not in universe | Not in universe | Children or Armed Forces | 6418 | 0 | 9 | Joint one under 65 & one 65+ | Not in universe | Not in universe | Householder | Householder | 687.19 | Nonmover | Nonmover | Nonmover | Yes | Not in universe | 1 | Not in universe | United-States | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 2 | 52 | 94 | 0 |
| 199520 | 47 | Not in universe | 0 | 0 | Some college but no degree | 0 | Not in universe | Married-civilian spouse present | Not in universe or children | Not in universe | White | All other | Male | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 157 | Joint both under 65 | Not in universe | Not in universe | Householder | Householder | 1923.03 | ? | ? | ? | Not in universe under 1 year old | ? | 6 | Not in universe | Poland | Poland | Germany | Foreign born- U S citizen by naturalization | 0 | Not in universe | 2 | 52 | 95 | 0 |
| 199521 | 16 | Not in universe | 0 | 0 | 10th grade | 0 | High school | Never married | Not in universe or children | Not in universe | White | All other | Female | Not in universe | Not in universe | Not in labor force | 0 | 0 | 0 | Nonfiler | Not in universe | Not in universe | Child <18 never marr not in subfamily | Child under 18 never married | 4664.87 | ? | ? | ? | Not in universe under 1 year old | ? | 0 | Both parents present | United-States | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 2 | 0 | 95 | 0 |
| 199522 | 32 | Private | 42 | 30 | High school graduate | 0 | Not in universe | Never married | Medical except hospital | Other service | Black | All other | Female | No | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Single | Not in universe | Not in universe | Nonfamily householder | Householder | 1830.11 | Nonmover | Nonmover | Nonmover | Yes | Not in universe | 6 | Not in universe | ? | ? | ? | Foreign born- Not a citizen of U S | 0 | Not in universe | 2 | 52 | 94 | 0 |
Most frequently occurring
| age | class_of_worker | industry_code | occupation_code | education | wage_per_hour | enrolled_in_edu_inst_last_wk | marital_status | major_industry_code | major_occupation_code | race | hispanic_Origin | sex | member_of_a_labor_union | reason_for_unemployment | full_or_part_time_employment_stat | capital_gains | capital_losses | divdends_from_stocks | tax_filer_status | region_of_previous_residence | state_of_previous_residence | detailed_household_and_family_stat | detailed_household_summary_in_household | instance_weight | migration_code_change_in_msa | migration_code_change_in_reg | migration_code_move_within_reg | live_in_this_house_1_year_ago | migration_prev_res_in_sunbelt | num_persons_worked_for_employer | family_members_under_18 | country_of_birth_father | country_of_birth_mother | country_of_birth_self | citizenship | own_business_or_self_employed | fill_inc_questionnaire_for_veterans_admin | veterans_benefits | weeks_worked_in_year | year | label | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 559 | 3 | Not in universe | 0 | 0 | Children | 0 | Not in universe | Never married | Not in universe or children | Not in universe | White | All other | Female | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Nonfiler | Not in universe | Not in universe | Child <18 never marr not in subfamily | Child under 18 never married | 2125.99 | ? | ? | ? | Not in universe under 1 year old | ? | 0 | Both parents present | United-States | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 0 | 0 | 95 | 0 | 6 |
| 1947 | 11 | Not in universe | 0 | 0 | Children | 0 | Not in universe | Never married | Not in universe or children | Not in universe | White | All other | Female | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Nonfiler | Not in universe | Not in universe | Child <18 never marr not in subfamily | Child under 18 never married | 1131.62 | Nonmover | Nonmover | Nonmover | Yes | Not in universe | 0 | Both parents present | United-States | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 0 | 0 | 94 | 0 | 6 |
| 104 | 0 | Not in universe | 0 | 0 | Children | 0 | Not in universe | Never married | Not in universe or children | Not in universe | White | All other | Male | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Nonfiler | Not in universe | Not in universe | Child <18 never marr not in subfamily | Child under 18 never married | 1363.88 | ? | ? | ? | Not in universe under 1 year old | ? | 0 | Both parents present | United-States | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 0 | 0 | 95 | 0 | 5 |
| 358 | 2 | Not in universe | 0 | 0 | Children | 0 | Not in universe | Never married | Not in universe or children | Not in universe | White | All other | Female | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Nonfiler | Not in universe | Not in universe | Child <18 never marr not in subfamily | Child under 18 never married | 1182.42 | ? | ? | ? | Not in universe under 1 year old | ? | 0 | Both parents present | United-States | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 0 | 0 | 95 | 0 | 5 |
| 590 | 3 | Not in universe | 0 | 0 | Children | 0 | Not in universe | Never married | Not in universe or children | Not in universe | White | All other | Male | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Nonfiler | Not in universe | Not in universe | Child <18 never marr not in subfamily | Child under 18 never married | 966.31 | ? | ? | ? | Not in universe under 1 year old | ? | 0 | Both parents present | United-States | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 0 | 0 | 95 | 0 | 5 |
| 603 | 3 | Not in universe | 0 | 0 | Children | 0 | Not in universe | Never married | Not in universe or children | Not in universe | White | All other | Male | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Nonfiler | Not in universe | Not in universe | Child <18 never marr not in subfamily | Child under 18 never married | 1220.24 | ? | ? | ? | Not in universe under 1 year old | ? | 0 | Both parents present | United-States | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 0 | 0 | 95 | 0 | 5 |
| 627 | 3 | Not in universe | 0 | 0 | Children | 0 | Not in universe | Never married | Not in universe or children | Not in universe | White | All other | Male | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Nonfiler | Not in universe | Not in universe | Child <18 never marr not in subfamily | Child under 18 never married | 1803.03 | Nonmover | Nonmover | Nonmover | Yes | Not in universe | 0 | Both parents present | United-States | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 0 | 0 | 94 | 0 | 5 |
| 881 | 5 | Not in universe | 0 | 0 | Children | 0 | Not in universe | Never married | Not in universe or children | Not in universe | White | All other | Female | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Nonfiler | Not in universe | Not in universe | Child <18 never marr not in subfamily | Child under 18 never married | 886.02 | ? | ? | ? | Not in universe under 1 year old | ? | 0 | Both parents present | United-States | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 0 | 0 | 95 | 0 | 5 |
| 1433 | 8 | Not in universe | 0 | 0 | Children | 0 | Not in universe | Never married | Not in universe or children | Not in universe | White | All other | Female | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Nonfiler | Not in universe | Not in universe | Child <18 never marr not in subfamily | Child under 18 never married | 1215.87 | ? | ? | ? | Not in universe under 1 year old | ? | 0 | Both parents present | United-States | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 0 | 0 | 95 | 0 | 5 |
| 1453 | 8 | Not in universe | 0 | 0 | Children | 0 | Not in universe | Never married | Not in universe or children | Not in universe | White | All other | Female | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Nonfiler | Not in universe | Not in universe | Child <18 never marr not in subfamily | Child under 18 never married | 1979.97 | ? | ? | ? | Not in universe under 1 year old | ? | 0 | Both parents present | United-States | United-States | United-States | Native- Born in the United States | 0 | Not in universe | 0 | 0 | 95 | 0 | 5 |